Domain Knowledge Data Science
In the recommender system example the model might calculate the affinity that a user has towards a product.
Domain knowledge data science. Consequently this will broaden the number of fields in which data specialists can and will cover their needs. With more companies entering the world of data iot and the cloud it is easier to see the benefits of hiring specialists to help them with their data science needs. Domain knowledge in data science is more important than ever.
The data scientist needs to have the domain knowledge to clearly articulate the domain specific assumptions that can be used to relate the problem goal to the calculated quantity. In large part this is for good reason as domain knowledge is often domain specific and generalises to a lesser degree than programming skills or statistical knowledge. We c an use the same definition in data science to say domain knowledge is the knowledge about the environment in which the data is processed to reveal secrets of the data.
The role of domain knowledge in data science without an extensive contextual understanding of the industry and sector you will struggle to move effectively beyond event framing to more sophisticated forms of data science such as predictive analytics condition monitoring mapping and conflict avoidance. It thus becomes obvious that domain knowledge is important both in the framework as well as the body of a data science project. Fifo vs lifo is one of kirill s tips and hacks in order to acquire domain knowledge.
In some cases data scientists might need to also have strong subject matter expertise additional to the technical skills but in other cases depending also on the industry or on the way in which the organization for which he she works is structured that might not be the same. It will make the project faster cheaper and more likely to yield a. Domain knowledge definitely helps in better making sense of the data and of the problem s context.
This episode talks about the importance of domain knowledge in data science.