A blog by Oleg Shilovitsky
Information & Comments about Engineering and Manufacturing Software

PLM: Manufacturing Big Data Ngram Dream?

PLM: Manufacturing Big Data Ngram Dream?
Oleg
Oleg
3 February, 2014 | 2 min for reading

plm-data-god

My attention was caught this weekend by thedailybeast article with funny title – Why Big Data Doesn’t Live up to the Hype. I read the article and on my long travel during the weekend skimmed over the book  Uncharted: Big Data as a Lens on Human Culture by Erez Aiden and Jean-Baptiste Michel mentioned in this article. The authors were instrumental in creating of Google Ngram Viewer.

The Google Ngram Viewer is a phrase-usage graphing tool developed by Jon Orwant and Will Brockman of Google, and charts the yearly count of selected n-grams (letter combinations)[n] or words and phrases,[1][2] as found in over 5.2 million books digitized by Google Inc (up to 2008).[3][4] The words or phrases (or ngrams) are matched by case-sensitive spelling, comparing exact uppercase letters,[2] and plotted on the graph if found in 40 or more books during each year (of the requested year-range).[5] The Ngram tool was released in mid-December 2010.[1][3]

The word-search database was created by Google Labs, based originally on 5.2 million books, published between 1500 and 2008, containing 500 billion words[6] in American English, British English, French, German, Spanish, Russian, Hebrew, and Chinese.[1] Italian words are counted by their use in other languages. A user of the Ngram tool has the option to select among the source languages for the word-search operations.[7]

Researchers have analysed the Google Ngram database of books written in American or British English discovering interesting results. Amongst them, they found correlations between the emotional output and significant events in the 20th century such as the World War II.[8]

If you never tried Ngram Viewer, you should. Navigate here and try it out. You can find some interesting trends. Here is my funny example – “data” is eclipsing “love” trend. Does it mean something? I’m not sure, but it is funny…

Screen Shot 2014-02-03 at 9.26.02 AM

Google certainly has a power to deal with such large projects. Everybody are trying to collect data these days. You can see some very interesting examples. Ambitions of CAD and PLM companies are not going so far… yet. Here is the idea for somebody with budget and free time – to collect product lifecycle information related to manufacturing industry, suppliers, material trends and consumer behaviors. More and more data becomes available publicly on the web. To collect and classify this information can help us to explore future demands and opportunities.

What is my conclusion? In data we trust. Data is a very powerful argument and we use it frequently. With globalization of manufacturing industry and ambitious to discover future trends and opportunity of manufacturing and supply chain, I can see collecting of publicly available manufacturing data as a key towards unknown unknowns. Just a crazy idea and my thoughts… Happy Monday!

Best, Oleg

Recent Posts

Also on BeyondPLM

4 6
6 April, 2019

I spent last week at Oracle NetSuite event in Las Vegas –  Suite World 2019. Want to learn more –...

21 April, 2014

PLM is in the focus on many companies these days. Questions how to improve processes, optimize cost and improve quality...

17 March, 2021

Autodesk was one of the first large vendors to announce the “cloud PLM” option back in 2011. Check my old...

19 October, 2010

I found an interesting news came from Google yesterday. Google introduced Google Cloud Connect function as part of the new...

18 July, 2016

Business Advantage Group, a market research firm published results of survey conducted in November 2015 related to the adoption and...

21 December, 2011

Cost is an important topic. Period. Everybody agrees with this statement. I can even say many companies investing a lot...

24 April, 2022

Have you heard about SaaSification? For the last few years, the process of turning applications delivery models into a software-as-a-service...

10 November, 2011

I spent my last two days attending Dassault System Customer Conference (DSCC 2011) in Las Vegas. It will take some...

17 September, 2013

BOM Management. Multiple BOM Views. These topic are always drives lots of discussion in a real life and online. There...

Blogroll

To the top