<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://lms.onnocenter.or.id/wiki/index.php?action=history&amp;feed=atom&amp;title=Data_Science_Strategy%3A_Definisi_%26_Scope</id>
	<title>Data Science Strategy: Definisi &amp; Scope - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://lms.onnocenter.or.id/wiki/index.php?action=history&amp;feed=atom&amp;title=Data_Science_Strategy%3A_Definisi_%26_Scope"/>
	<link rel="alternate" type="text/html" href="https://lms.onnocenter.or.id/wiki/index.php?title=Data_Science_Strategy:_Definisi_%26_Scope&amp;action=history"/>
	<updated>2026-04-20T04:44:40Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.45.1</generator>
	<entry>
		<id>https://lms.onnocenter.or.id/wiki/index.php?title=Data_Science_Strategy:_Definisi_%26_Scope&amp;diff=63133&amp;oldid=prev</id>
		<title>Onnowpurbo: Created page with &quot;Defining and Scoping a Data Science Strategy To understand the constituent parts of a data science strategy as well the strate- gy’s current and future significance, it’s...&quot;</title>
		<link rel="alternate" type="text/html" href="https://lms.onnocenter.or.id/wiki/index.php?title=Data_Science_Strategy:_Definisi_%26_Scope&amp;diff=63133&amp;oldid=prev"/>
		<updated>2021-04-07T02:24:49Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;Defining and Scoping a Data Science Strategy To understand the constituent parts of a data science strategy as well the strate- gy’s current and future significance, it’s...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;Defining and Scoping a Data&lt;br /&gt;
Science Strategy&lt;br /&gt;
To understand the constituent parts of a data science strategy as well the strate-&lt;br /&gt;
gy’s current and future significance, it’s worthwhile to look at some of the major&lt;br /&gt;
components on a high level. I then address each of these different parts in detail&lt;br /&gt;
throughout this book. But before that I need to make a short clarification of the&lt;br /&gt;
difference between a data science strategy and a data strategy.&lt;br /&gt;
On a high-level a data science strategy refers to the strategy you define with regards&lt;br /&gt;
to the entire data science investment in our company. It includes areas such as&lt;br /&gt;
overall data science objectives and strategic choices, regulatory strategies, data&lt;br /&gt;
need, competences and skillsets, data architecture, as well as how to measure the&lt;br /&gt;
outcome. The data strategy on the other hand, constitutes a subset of the data sci-&lt;br /&gt;
ence strategy, and is focused on outlining the strategic direction directly related to&lt;br /&gt;
the data. This includes areas such as data scope, data consent, legal, regulatory and&lt;br /&gt;
ethical considerations, storage collection frequency, data storage retention periods,&lt;br /&gt;
data management process and principles, and last, but not least; data governance.&lt;br /&gt;
Both strategies are needed in order to succeed with your data science investment&lt;br /&gt;
and should complement each other in order to work. The details of how to define&lt;br /&gt;
the data strategy is captured in part 2 of this book.&lt;br /&gt;
Objectives&lt;br /&gt;
If I ask about the objectives of a data science strategy, I’m asking whether there&lt;br /&gt;
are clear company objectives set and agreed on for any of the investments made in&lt;br /&gt;
data science. Are the objectives formulated in a way that makes them possible to&lt;br /&gt;
execute and measure success by? If not, then the objectives need to be reformu-&lt;br /&gt;
lated; this is a critically important starting point that must be completed properly&lt;br /&gt;
in order to succeed down the line.&lt;br /&gt;
Data science is a new field that holds amazing opportunities for companies to&lt;br /&gt;
drive a fundamental transformation, but it is complex and often not fully under-&lt;br /&gt;
stood by top management. You should consider whether the executive team’s&lt;br /&gt;
understanding of data science is sufficient to set the right targets or whether they&lt;br /&gt;
need to be educated and then guided in setting their target.&lt;br /&gt;
Whether you’re a manager or an employee in a small or large company, if you&lt;br /&gt;
want your company to succeed with its data science investment, don’t sit and&lt;br /&gt;
hope that the leadership of your company will understand what needs to be done.&lt;br /&gt;
If you’re knowledgeable in the area, make your voice heard or, if you aren’t, don’t&lt;br /&gt;
hesitate to accept help from those who have experience in the field.&lt;br /&gt;
26&lt;br /&gt;
PART 1 Optimizing Your Data Science InvestmentIf you decide to bring in external experts to assist you in your data science&lt;br /&gt;
strategizing, be sure to read up on the area yourself first, so that you can judge&lt;br /&gt;
the relevance of their recommendations for your business — the place where you&lt;br /&gt;
are the expert.&lt;br /&gt;
Approach&lt;br /&gt;
Taking the right initial approach is a fundamental part of your data science&lt;br /&gt;
strategy — it will determine whether your company takes the appropriate imple-&lt;br /&gt;
mentation and transformation approach for the data science investment. For&lt;br /&gt;
example, is the approach ambitious enough — or is it too ambitious, considering&lt;br /&gt;
time estimates related to available competence? Is there a clear business strategy&lt;br /&gt;
and expected value that the data science strategy can relate to? Taking the time to&lt;br /&gt;
think through the approach is sure to pay off because, if you don’t know where&lt;br /&gt;
you’re going, you are most unlikely to end up there.&lt;br /&gt;
Choices&lt;br /&gt;
The term choices here refers to the strategic choices necessary to drive the data&lt;br /&gt;
science transformation forward.&lt;br /&gt;
The strategy you create cannot be about doing everything. It’s equally important&lt;br /&gt;
to make strategic choices about what to do as it is to make decisions about what&lt;br /&gt;
not to do.&lt;br /&gt;
Decisions can also be distributed differently over time, because the choices can be&lt;br /&gt;
about starting with a particular business area or set of customers, learning from&lt;br /&gt;
that experience, and then continuing to include other areas or customers. The&lt;br /&gt;
same strategy applies to choices of data categories or types to focus on early rather&lt;br /&gt;
than later on as the company matures and capabilities expand.&lt;br /&gt;
Data&lt;br /&gt;
Defining a data strategy is a cornerstone of the data science strategy — it includes&lt;br /&gt;
all aspects related to the data, such as whether or not you understand the various&lt;br /&gt;
types of data you need to access in order to achieve your business objectives. Is the&lt;br /&gt;
data available? How will you approach data management and data storage? Have&lt;br /&gt;
you set priorities on the data? Have you identified and set data quality targets?&lt;br /&gt;
Another important aspect of data relates to data governance and security. Data&lt;br /&gt;
will be one of your most valuable assets going forward; how you treat it is funda-&lt;br /&gt;
mental to your company’s success.&lt;br /&gt;
CHAPTER 1 Framing Data Science Strategy&lt;br /&gt;
27Legal&lt;br /&gt;
Understanding the legal implications for the data you need in terms of access&lt;br /&gt;
rights, ownership, and usage models is vital. If you aren’t on top of this aspect&lt;br /&gt;
early on, you might find yourself in a situation where you cannot get hold of the&lt;br /&gt;
data you need for your business without breaking the law, or, even if you can get&lt;br /&gt;
hold of the data, you may realize that you cannot use it in the way you need in&lt;br /&gt;
order to fulfil your business objectives.&lt;br /&gt;
Laws and regulations related to data privacy stretch further than many people&lt;br /&gt;
think, and they keep changing in order to protect people’s data integrity. This is&lt;br /&gt;
good from a privacy perspective, but doesn’t always work well with data innova-&lt;br /&gt;
tion. Therefore, as a good investment, you should always stay informed about&lt;br /&gt;
laws and regulations related to the data needed for your business.&lt;br /&gt;
Ethics&lt;br /&gt;
Ethics, an area of growing importance, refers to the creation of clear ethical guide-&lt;br /&gt;
lines for how data science is approached in the company. Internally, this term&lt;br /&gt;
refers to securing a responsible approach to data usage and management when it&lt;br /&gt;
comes to preserving the data privacy of your customers or other stakeholders. One&lt;br /&gt;
way of protecting privacy is through anonymizing personal information in the&lt;br /&gt;
data sets.&lt;br /&gt;
Externally, insisting on the ethics of data science is vital when it comes to gaining&lt;br /&gt;
your customers’ trust in how you handle data. When machine learning or artificial&lt;br /&gt;
intelligence is introduced — especially when automation of decisions and preven-&lt;br /&gt;
tive actions are involved — it touches on another ethical perspective: the “explain-&lt;br /&gt;
ability” of algorithms. It refers to the idea that it must be possible to explain a&lt;br /&gt;
decision or action taken by a machine. Machine learning or artificial intelligence&lt;br /&gt;
cannot become an automated, black box execution by a machine. Humans must&lt;br /&gt;
stay in control to secure the transparency of AI algorithms and ensure that ethical&lt;br /&gt;
boundaries are kept.&lt;br /&gt;
Competence&lt;br /&gt;
Based on the objectives that are set, choices that are made, and approach that is&lt;br /&gt;
chosen, you must ensure that you put the right competence in place to execute on&lt;br /&gt;
your targets. Putting together an experienced and competent data science team is&lt;br /&gt;
easier said than done. Why is that? Well, you really need three main categories of&lt;br /&gt;
competencies, and the availability of experienced data scientists in the market is&lt;br /&gt;
now very low, simply because few data scientists have the sufficient experience&lt;br /&gt;
and because the demand for these types of competencies is very high.&lt;br /&gt;
28&lt;br /&gt;
PART 1 Optimizing Your Data Science InvestmentYou can’t get by with simply hiring only data scientists. Data engineers with a&lt;br /&gt;
genuine understanding of the data in focus is fundamental. Without good data&lt;br /&gt;
management, data scientists cannot perform their algorithmic magic. It’s as&lt;br /&gt;
simple as that.&lt;br /&gt;
Finally, you need to secure domain expertise for the targeted area, whether it’s a&lt;br /&gt;
vast business understanding or an exceptional operational understanding. It’s&lt;br /&gt;
absolutely crucial to have the domain experts working closely with the data&lt;br /&gt;
engineers and the data scientists to achieve productive data science teams in your&lt;br /&gt;
organization.&lt;br /&gt;
Infrastructure&lt;br /&gt;
When talking about infrastructure, it’s all about understanding what is needed in&lt;br /&gt;
terms of data architecture and applications in order to enable a productive and&lt;br /&gt;
innovative environment for your data science teams. It includes considering both&lt;br /&gt;
a development environment (a workspace where you innovate, develop, train, and&lt;br /&gt;
test new capabilities) and a production environment (a runtime environment&lt;br /&gt;
where you deploy and run your solutions).&lt;br /&gt;
Infrastructure includes all aspects, from how you’ll set up your data collection/&lt;br /&gt;
data ingest, anonymization, data storage, data management, and application layer&lt;br /&gt;
with tools for the analytics and ML/AI development and production environment.&lt;br /&gt;
It is impossible to identify and set up the perfect environment, especially because&lt;br /&gt;
the technology evolution in this area is moving very fast. However, a vital part of&lt;br /&gt;
the infrastructure setup is to avoid getting locked into a situation where you&lt;br /&gt;
become entirely dependent on a certain infrastructure vendor (hardware, software,&lt;br /&gt;
or cloud, for example). I don’t mean that you should only go for open source prod-&lt;br /&gt;
ucts, but I do mean that you have to think carefully which building blocks you’re&lt;br /&gt;
using and then make sure that they’re exchangeable in the long run, if needed.&lt;br /&gt;
Governance and security&lt;br /&gt;
Working actively with data governance and security will make sure you stay in&lt;br /&gt;
control of data usage at all times. It isn’t important only in terms of gaining your&lt;br /&gt;
customers’ trust, but it is in many cases also a necessity for following the law.&lt;br /&gt;
Keeping track of which data is collected, stored, and used for which use cases is a&lt;br /&gt;
minimum requirement for most types of data.&lt;br /&gt;
Overworking the area of governance and security will have an impact on your data&lt;br /&gt;
science productivity and innovation. A common mistake is to be overprotective&lt;br /&gt;
with regard to data usage, keeping all data locked in to a degree that nobody can&lt;br /&gt;
access what they need in order to do their job. Therefore, you should approach the&lt;br /&gt;
CHAPTER 1 Framing Data Science Strategy&lt;br /&gt;
29setup of data governance and security with a mindset of openness when it comes&lt;br /&gt;
to sharing data amongst employees within the organization. Lock the gates to&lt;br /&gt;
outsiders, but strive for an open-data approach internally, boosting collaboration,&lt;br /&gt;
reuse, and innovation.&lt;br /&gt;
Commercial/business models&lt;br /&gt;
As part of your company’s data science strategy, you need to consider whether you&lt;br /&gt;
only want to focus your efforts internally as a means of improving operational&lt;br /&gt;
efficiency or whether you have ambitions to utilize data science to improve your&lt;br /&gt;
commercial business models. Improving your business using data science will&lt;br /&gt;
absolutely expand your possibilities, both in improving current business as well as&lt;br /&gt;
helping you find new opportunities.&lt;br /&gt;
Tread carefully when commercializing data. If you haven’t transformed internally&lt;br /&gt;
first by implementing data-driven operations, you’ll likely be unable to fully&lt;br /&gt;
leverage a data science approach externally in the business perspective.&lt;br /&gt;
That doesn’t mean you need to implement and run data-driven operations&lt;br /&gt;
throughout the company, but such operations will be needed for the areas con-&lt;br /&gt;
nected to the new data-science-based business models and commercial offerings&lt;br /&gt;
you’re aiming to realize.&lt;br /&gt;
Measurements&lt;br /&gt;
Without measuring your success, how will you ever know whether you have&lt;br /&gt;
actually achieved your objectives? Or be able to prove that. Still, many companies&lt;br /&gt;
fail to think of measurements early on.&lt;br /&gt;
Measurements are needed not only from an internal operational efficiency&lt;br /&gt;
perspective but also to measure whether you have managed to deliver on the&lt;br /&gt;
promises made to customers. This is important regardless of whether the agreed-&lt;br /&gt;
on customer targets have been contracted or not. It should always be a priority for&lt;br /&gt;
you to know how your business is performing against your objectives. The feed-&lt;br /&gt;
back will give you all the information you need to determine where the business&lt;br /&gt;
stands, what needs to improve, and what has perhaps already been achieved.&lt;br /&gt;
Yes, establishing measurements early on is fundamental when it comes to secur-&lt;br /&gt;
ing continuous learning in your company, but it also shows customers that you&lt;br /&gt;
care about reaching your targets. However, don’t forget to think through the met-&lt;br /&gt;
rics structure you plan to use. It isn’t an easy task to identify and define the correct&lt;br /&gt;
set of metrics from the start. This is also something that needs to be reevaluated&lt;br /&gt;
over time, based on which measurements actually give you the insights and&lt;br /&gt;
feedback needed on what is going well — and what isn’t going so well.&lt;/div&gt;</summary>
		<author><name>Onnowpurbo</name></author>
	</entry>
</feed>