What Is Machine Learning?
Machine learning is having a dramatic impact on the way
software is designed so that it can keep pace with busi-
ness change. Machine learning is so dramatic because it
helps you use data to drive business rules and logic. How is this
different? With traditional software development models, pro-
grammers wrote logic based on the current state of the business
and then added relevant data. However, business change has
become the norm. It is virtually impossible to anticipate what
changes will transform a market.
The value of machine learning is that it allows you to continually
learn from data and predict the future. This powerful set of algo-
rithms and models are being used across industries to improve
processes and gain insights into patterns and anomalies within
data.
But machine learning isn’t a solitary endeavor; it’s a team process
that requires data scientists, data engineers, business analysts,
and business leaders to collaborate. The power of machine learn-
ing requires a collaboration so the focus is on solving business.
Iterative Learning Machine Learning
Iterative learning Machine Learning from data Machine learning enables models to train on data sets before being deployed. Some machine learning models are online and contin- uously adapt as new data is ingested. On the other hand, other models, called offline machine learning models, are derived from machine learning algorithms but, once deployed, do not change. This Iterative learning Machine Learning process of online models leads to an improvement in the types of associations that are made between data elements. Due to their complexity and size, these patterns and associations could have easily been overlooked by human observation. After a model has been trained, these models can be used in real time to learn from data.Big Data
Defining Big Data Big data is any kind of data source that has at least one of four shared characteristics.- Extremely large Volumes of data
- The ability to move that data at a high Velocity of speed
- An ever-expanding Variety of data sources
- Veracity so that data sources truly represent truth.
Supervised learning
Supervised learning typically begins with an established set of
data and a certain understanding of how that data is classified.
Supervised learning is intended to find patterns in data that can
be applied to an analytics process. This data has labeled features
that define the meaning of data. For example, there could be mil-
lions of images of animals and include an explanation of what
each animal is and then you can create a machine learning appli-
cation that distinguishes one animal from another. By labeling
this data about types of animals, you may have hundreds of cat-
egories of different species. Because the attributes and the mean-
ing of the data have been identified, it is well understood by the
users that are training the modeled data so that it fits the details
of the labels. When the label is continuous, it is a regression; when
the data comes from a finite set of values, it known as classifica-
tion. In essence, regression used for supervised learning helps
you understand the correlation between variables. An example of
supervised learning is weather forecasting. By using regression
analysis, weather forecasting takes into account known historical
weather patterns and the current conditions to provide a predic-
tion on the weather.
The algorithms are trained using preprocessed examples, and at
this point, the performance of the algorithms is evaluated with
test data. Occasionally, patterns that are identified in a subset
of the data can’t be detected in the larger population of data. If
the model is fit to only represent the patterns that exist in the
training subset, you create a problem called overfitting. Overfit-
ting means that your model is precisely tuned for your training
data but may not be applicable for large sets of unknown data.
To protect against overfitting, testing needs to be done against
unforeseen or unknown labeled data. Using unforeseen data for
the test set can help you evaluate the accuracy of the model in
predicting outcomes and results. Supervised training models have
broad applicability to a variety of business problems, including
fraud detection, recommendation solutions, speech recognition,
or risk analysis.
Unsupervised learning
Unsupervised learning is best suited when the problem requires
a massive amount of data that is unlabeled. For example, social
media applications, such as Twitter, Instagram, Snapchat, and
16 Machine Learning For Dummies, IBM Limited Edition
These materials are © 2018 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
so on all have large amounts of unlabeled data. Understand-
ing the meaning behind this data requires algorithms that can
begin to understand the meaning based on being able to classify
the data based on the patterns or clusters it finds. Therefore,
the supervised learning conducts an Iterative learning Machine Learning process of analyz-
ing data without human intervention. Unsupervised learning is
used with email spam-detecting technology. There are far too
many variables in legitimate and spam emails for an analyst to
flag unsolicited bulk email. Instead, machine learning classifiers
based on clustering and association are applied in order to iden-
tify unwanted email.
0 Comments