Microsoft Corporation

MSFT

US5949181045

Software

Market Closed - Nasdaq Other stock markets 16:00:00 2024-04-29 EDT			After market 19:57:35
402.2 ^USD	-1.00%		402	-0.05%

07:41am	Wall Street: buoyed by Tesla, confident ahead of FOMC meeting	CF
07:16am	News Highlights : Top Company News of the Day - Tuesday at 1 AM ET	DJ

Microsoft : researchers help develop a way to transfer learning for machine reading comprehension

July 26, 2017 at 12:36 pm

By Xiaodong He, Principal Researcher, Microsoft Research

For human beings, reading comprehension is a basic task, performed daily. As early as in elementary school, we can read an article, and answer questions about its key ideas and details.

But for AI, full reading comprehension is still an elusive goal-but a necessary one if we're going to measure and achieve general intelligence AI. In practice, reading comprehension is necessary for many real-world scenarios, including customer support, recommendations, question answering, dialog and customer relationship management. It has incredible potential for situations such as helping a doctor quickly find important information amid thousands of documents, saving their time for higher-value and potentially life-saving work.

Therefore, building machines that are able to perform machine reading comprehension (MRC) is of great interest. In search applications, machine comprehension will give a precise answer rather than a URL that contains the answer somewhere within a lengthy web page. Moreover, machine comprehension models can understand specific knowledge embedded in articles that usually cover narrow and specific domains, where the search data that algorithms depend upon is sparse.

Microsoft is focused on machine reading and is currently leading a competition in the field. Multiple projects at Microsoft, including Deep Learning for Machine Comprehension, have also set their sights on MRC. Despite great progress, a key problem has been overlooked until recently-how to build an MRC system for a new domain?

Recently, several researchers from Microsoft Research AI, including Po-Sen Huang, Xiaodong He and intern David Golub, from Stanford University, developed a transfer learning algorithm for MRC to attack this problem. Their work is going to be presented at EMNLP 2017, a top natural language processing conference. This is a key step towards developing a scalable solution to extend MRC to a wider range of domains.

It is an example of the progress we are making toward a broader goal we have at Microsoft: creating technology with more sophisticated and nuanced capabilities. 'We're not just going to build a bunch of algorithms to solve theoretical problems. We're using them to solve real problems and testing them on real data,' said Rangan Majumder in the machine reading blog.

Currently, most state-of-the-art machine reading systems are built on supervised training data-trained end-to-end on data examples, containing not only the articles but also manually labeled questions about articles and corresponding answers. With these examples, the deep learning-based MRC model learns to understand the questions and infer the answers from the article, which involves multiple steps of reasoning and inference.

However, for many domains or verticals, this supervised training data does not exist. For example, if we need to build a new machine reading system to help doctors find important information about a new disease, there could be many documents available, but there is a lack of manually labeled questions about the articles, and the corresponding answers. This challenge is magnified by both the need to build a separate MRC system for each different disease, and that the volume of literature is increasing rapidly. Therefore, it is of crucial importance to figure out how to transfer an MRC system to a new domain where no manually labeled questions and answers are available, but there is a body of documents.

Microsoft researchers developed a novel model called 'two stage synthesis network,' or SynNet, to address this critical need. In this approach, based on the supervised data available in one domain, the SynNet first learns a general pattern of identifying potential 'interestingness' in an article. These are key knowledge points, named entities, or semantic concepts that are usually answers that people may ask for. Then, in the second stage, the model learns to form natural language questions around these potential answers, within the context of the article. Once trained, the SynNet can be applied to a new domain, read the documents in the new domain and then generate pseudo questions and answers against these documents. Then, it forms the necessary training data to train an MRC system for that new domain, which could be a new disease, an employee handbook of a new company, or a new product manual.

The idea of generating synthetic data to augment insufﬁcient training data has been explored before. For example, for the target task of translation, Rico Sennrich and colleagues present a method in their paper to generate synthetic translations given real sentences to reﬁne an existing machine translation system. However, unlike machine translation, for tasks like MRC, we need to synthesize both questions and answers for an article. Moreover, while the question is a syntactically ﬂuent natural language sentence, the answer is mostly a salient semantic concept in the paragraph, such as a named entity, an action, or a number. Since the answer has a different linguistic structure than the question, it may be more appropriate to view answers and questions as two different types of data.

In our approach, we decompose the process of generating question-answer pairs into two steps: The answer generation conditioned on the paragraph and the question generation conditioned on the paragraph and the answer. We generate the answer ﬁrst because answers are usually key semantic concepts, while questions can be viewed as a full sentence composed to inquire about the concept.

The SynNet is trained to synthesize the answer and the question of a given paragraph. The ﬁrst stage of the model, an answer synthesis module, uses a bi-directional long short-term memory (LSTM) to predict inside-outside beginning (IOB) tags on the input paragraph, which mark out key semantic concepts that are likely answers. The second stage, a question synthesis module, uses a uni-directional LSTM to generate the question, while attending on embeddings of the words in the paragraph and IOB IDs. Although multiple spans in the paragraph could be identiﬁed as potential answers, we pick one span when generating the question.

Two examples of generated questions and answers from articles are illustrated below:

Using the SynNet, we were able to get more accurate results on a new domain without any additional training data, approaching to the performance of a fully supervised MRC system.

SynNet, trained on SQuAD (Wikipedia articles), performs almost as well on the NewsQA domain (news articles), as a system fully trained on NewsQA.

The SynNet is like a teacher, who, based on her experience in previous domains, creates questions and answers from articles in the new domain, and uses these materials to teach her students to perform reading comprehension in the new domain. Accordingly, Microsoft researchers also developed a set of neural machine reading models, including the recently developed ReasoNet that has shown a lot of promise, which are like the students who learn from the teaching materials to answer questions based on the article.

To our knowledge, this is the first attempt to apply MRC domain transferring. We are looking forward to developing scalable solutions that rapidly expand the capability of MRC to release the game-changing potential of machine reading!

Related:

Microsoft Corporation published this content on 26 July 2017 and is solely responsible for the information contained herein.
Distributed by Public, unedited and unaltered, on 26 July 2017 16:35:46 UTC.

Original documenthttps://www.microsoft.com/en-us/research/blog/transfer-learning-machine-reading-comprehension/

Public permalinkhttp://www.publicnow.com/view/A6146EFE098176221C5C0683B53AA7AB8E7EA4F6

Latest news about Microsoft Corporation

Wall Street: buoyed by Tesla, confident ahead of FOMC meeting	01:41am	CF
News Highlights : Top Company News of the Day - Tuesday at 1 AM ET	01:16am	DJ
Microsoft to Invest $1.7 Billion in AI Infrastructure in Indonesia	12:56am	DJ
News Highlights : Top Financial Services News of the Day - Tuesday at 12 AM ET	12:15am	DJ
Microsoft to invest $1.7 bln in cloud, AI in Indonesia, CEO says	12:13am	RE
UnitedHealth hackers took advantage of Citrix vulnerabilty to break in, CEO says	04-29	RE
Wall Street: buoyed by Tesla, confident ahead of FOMC meeting	04-29	CF
News Highlights : Top Financial Services News of the Day - Monday at 4 PM ET	04-29	DJ
Tech Up Ahead of Earnings -- Tech Roundup	04-29	DJ
S&P 500 Companies' Latest Quarterly Results Mixed so Far, Oppenheimer Says	04-29	MT
Wall Street stocks gain as investors focus on Fed moves	04-29	RE
Axel Springer to migrate some cloud applications to Microsoft's Azure	04-29	RE
Tesla short sellers lose nearly $5.5 bln over four days, S3 Partner says	04-29	RE
Wall St edges up as Tesla and Apple gain; Fed decision awaited	04-29	RE
Wall Street continues to rise while awaiting Fed meeting	04-29	CF
News Highlights : Top Financial Services News of the Day - Monday at 11 AM ET	04-29	DJ
Apple: Brussels calls for iPad compliance	04-29	CF
OpenAI Faces Privacy Complaint in Austria over Inaccurate Information on ChatGPT	04-29	MT
Conduent to Work With Microsoft on Generative AI Solutions; Shares Rise	04-29	MT
MICROSOFT CORP : DZ Bank gives a Buy rating	04-29	ZD
Wall Street: a wait-and-see attitude before the Fed and earnings reports	04-29	CF
Microsoft-Backed OpenAI Faces Austrian Privacy Complaint Over False Data on Individuals	04-29	MT
Tech Rally Subsides as US Equity Futures Waver Pre-Bell	04-29	MT
Conduent Incorporated Collaborates with Microsoft on Generative AI to Drive Innovation in Business Process Solutions	04-29	CI
Conduent Collaborates with Microsoft on Generative AI to Drive Innovation in Business Process Solutions	04-29	CI

Chart Microsoft Corporation

Duration

Period

More charts

Company Profile

Microsoft Corporation is the world's leader in the design, development and marketing of operating systems and software programs for PC's and servers. The group also builds and sells computer equipment. Net sales break down by activity as follows: - sale of operating systems and application development tools (47.9%): primarily for servers (Azure, SQL Server, Windows Server, Visual Studio, System Center, GitHub, etc.) and (Windows); - development of cloud-based software applications (23%): programs for productivity (Microsoft 365; Word, Excel, PowerPoint, Outlook, OneNote, Publisher and Access), integrated management and customer relationship management (Dynamics 365), online file sharing and management (OneDrive), and unified and collaborative communications (Skype and Microsoft Teams); - sale of video gaming hardware and software (7.3%) : mainly Xbox; - enterprise services (3.6%); - sale of computers, tablets and accessories (2.6%); - other (15.6%). The United States accounts for 50.4% of net sales.

Sector

Software

Calendar

2024-05-14 - Ex-dividend day

Related indices

Dow Jones Industrial , S&P 500

More about the company

Income Statement Evolution

More financial data

Analysis / Opinion

Microsoft Growth to Be Driven by Expanding AI Spending, Morgan Stanley Says

April 19, 2024 at 09:22 am

More Strategies

Ratings for Microsoft Corporation

Trading Rating

Investor Rating

ESG Refinitiv

C+

More Ratings

Analysts' Consensus

Sell

Buy

Mean consensus

BUY

Number of Analysts

Last Close Price

402.2 USD

Average target price

476.6 USD

Spread / Average Target

+18.48%

Consensus

EPS Revisions

Estimates Revisions

Quarterly earnings - Rate of surprise

Company calendar

Sector Other Software

	1st Jan change	Capi.
MICROSOFT CORPORATION	+6.97%	2,990B
SYNOPSYS INC.	+5.80%	83.1B
CADENCE DESIGN SYSTEMS, INC.	+3.72%	76.88B
DASSAULT SYSTÈMES SE	-15.27%	52.88B
PALANTIR TECHNOLOGIES INC.	+32.96%	50.84B
ATLASSIAN CORPORATION	-25.56%	46.09B
THE TRADE DESK, INC.	+17.43%	41.3B
SEA LIMITED	+59.31%	37.06B
TAKE-TWO INTERACTIVE SOFTWARE, INC.	-10.36%	24.61B
ROBLOX CORPORATION	-20.03%	23.39B

Other Software

Microsoft Corporation

Equities

MSFT

US5949181045

Software

Microsoft : researchers help develop a way to transfer learning for machine reading comprehension

EPS Revisions