1 Natural Language Processing-Intro
1 Natural Language Processing-Intro
1 Natural Language Processing-Intro
Processing
Introduction
• Natural language processing (NLP) is an area of computer science and
artificial intelligence concerned with the interactions between
computers and human (natural) languages, in particular how to
program computers to process and analyze large amounts of natural
language data.
• The history of natural language processing generally started in the 1950s, although
work can be found from earlier periods. In 1950, Alan Turing published an article
titled "Computing Machinery and Intelligence" which proposed what is now called
the Turing test as a criterion of intelligence.
• NLP is not a new science, the technology is rapidly advancing thanks to an increased
interest in human-to-machine communications, plus an availability of big data,
powerful computing and enhanced algorithms.
• As a human, we may speak and write in English, Hindi, Spanish or Chinese, But a
computer’s native language – known as machine code or machine language – is
largely incomprehensible to people.
• At your device’s lowest levels, communication occurs not with words but through
millions of zeros and ones that produce logical actions.
Why is NLP important?
• Human language is astoundingly complex and diverse. We express ourselves in infinite ways, both
verbally and in writing. Not only are there hundreds of languages and dialects, but within each
language is a unique set of grammar and syntax rules, terms and slang.
• When we write, we often misspell or abbreviate words, or omit punctuation. When we speak, we
have regional accents, and we mumble, stutter and borrow terms from other languages.
• While supervised and unsupervised learning, and specifically deep learning, are now widely used
for modelling human language, there’s also a need for syntactic and semantic understanding and
domain expertise that are not necessarily present in these machine learning approaches.
• NLP is important because it helps resolve ambiguity in language and adds useful numeric structure
to the data for many downstream applications, such as speech recognition or text analytics.
How does NLP work?
To Breaking down the elemental pieces of language
• Natural language processing includes many different techniques for interpreting
human language, ranging from statistical and machine learning methods to rules-
based and algorithmic approaches. We need a broad array of approaches because
the text- and voice-based data varies widely, as do the practical applications.
• In general terms, NLP tasks break down language into shorter, elemental pieces,
try to understand relationships between the pieces and explore how the pieces
work together to create meaning.
How does NLP work?
These underlying tasks are often used in higher-level NLP capabilities, such as:
Content categorization A linguistic-based document summary, including search and
indexing, content alerts and duplication detection.
Topic discovery and modelling Accurately capture the meaning and themes in text
collections, and apply advanced analytics to text, like optimization and forecasting.
Contextual extraction Automatically pull structured information from text-based sources.
Sentiment analysis Identifying the mood or subjective opinions within large amounts of text,
including average sentiment and opinion mining.
Speech-to-text and text-to-speech conversion Transforming voice commands into written
text, and vice versa.
Document summarization Automatically generating synopses of large bodies of text.
Machine translation. Automatic translation of text or speech from one language to another.
How computers make sense of textual data
NLP and Text analytics
• Natural language processing goes hand in hand with text analytics, which counts,
groups and categorizes words to extract structure and meaning from large
volumes of content. Text analytics is used to explore textual content and derive
new variables from raw text that may be visualized, filtered, or used as inputs to
predictive models or other statistical methods.
• NLP and text analytics are used together for many applications, including:
• Investigative discovery. Identify patterns and clues in emails or written reports to
help detect and solve crimes.
• Subject-matter expertise. Classify content into meaningful topics so you can take
action and discover trends.
• Social media analytics. Track awareness and sentiment about specific topics and
identify key influencers.
How computers make sense of textual data
Everyday NLP Examples
There are many common and practical applications of NLP in everyday lives.
Virtual assistants like Alexa or Siri.
Spam filtering, a statistical NLP technique that compares the words in spam
to valid emails to identify junk mail.
Phoneme - The smallest unit of sound that has meaning within a language