PLM, Semantics Technology and Data Federation

by Oleg on June 7, 2012 · 9 comments

I’m in a deep technological mood these days. As you probably noticed, I’m attending Semantic Technology & Business conference in beautiful, but cold San Francisco. SemTech 2012 covers an interesting technological space that covers a variety of topics related to data, data management, big data, semantics, linked data and semantic web. So, the environment of the conference and some presentations made me think about some modern trends in data management related to data federation. It probably goes a bit beyond the technological level of this blog, but I found it interesting and insightful.

Distributed Data Architecture

Our world is getting more and more distributed. The time when you  was able to concentrate the data in a single computer and/or databases almost became a history. We are moving towards something bigger that can scale to the level of web. The following two examples show a potential role of semantic technologies in support of federated data environment:

Andrew Sunderland of Spry Inc presented enterprise data management options. Here is the interesting quote explaining his presentation:

Companies are looking for methods to quickly expose data sources for federated data access, while at the same time developing a robust, executable enterprise ontology. Data profiling tools can be leveraged to profile data sources and bootstrap ontologies and mappings. This talk will showcase how Spry is leveraging these tools to quickly expose data sources, while in developing an enterprise ontology

Another example is coming from FluidOpsTransformation of Enterprise Data Islands into Linked and Living Knowledge. Information Workbench environment coming from FluidOps. The discussion focus was on the transformation of enterprise data islands into linked and living knowledge and elaborates on the costs and benefits of managing information in a unified semantic space.

The following picture shows Information Workbench architecture and the role of semantic technologies to achieve the role of data unification.

Data Federation and Asymmetric Computing

I had a chance to attend the presentation of Bryan Thompson of Systap discussing the bigdata® architecture. His presentation was focused on the computing side of distributed data environment and federation. The following slide presents the role of RDF and graph as a unified model for heterogeneous data sources.

How is that related to PLM?

Now, you can ask me- how it is related to PDM and Engineering and Manufacturing world. Here is my take. IT infrastructure of manufacturing companies is extremely complicated these days. It includes existing data management and enterprise systems, content and document management vaults, unmanaged files and other data sources. Nowadays, cloud and web are coming as an additional data places companies target for data. The overall environment is global and distributed. Existing PLM systems are striving towards centralization of data into a singe data. The single database architecture might be not sufficient, cost of data transition might be too high, cloud and globalization is another dimension of complexity. Distributed and federated data management capable to scale to the level of web – logically and physically can be an interesting platform option to discover.

What is my conclusion? The history gave us many examples when large companies missed new technological trends, and it cost them to lose their leadership position. At the same time, we can see how web companies built their infrastructure and disrupted many existing domains. What will be the technological foundation that can support challenges manufacturing and engineering companies are facing today? What will be the role of semantic data technologies in the future of these systems is a right question to ask these days.

Best, Oleg

Share
  • MarcL

    Oleg – Nice post, glad someone is talking about this topic. This view of the future matches our take at Aras, that federated data arch for big data is increasingly the direction. We call it the connected cloud. Have explained how this relates to PLM at http://aras.com/plm/001206

    Hope you keep these forward looking blogs coming.

    MarcL
    http://www.aras.com

  • beyondplm

    Marc, thanks for the comment! I think it is quite important to focus on the network. As far as I understand Aras federation based on the proprietary technology Aras developed. However, the role of cloud is not clear. Can you elaborate please? Tnx, Oleg

  • Sandy Klausner

    Oleg, your observations are spot on, but I seriously question if the Semantic Web architecture can support the required contextualization requirements for managing this level of distributed data.  Many, many open base technological issues that the W3C specifications remain silent.

  • beyondplm

    Sandy, thanks for the comment! You concern makes sense. Semantic Web and sem tech technologies are making significant progress for the last 3-5 years. I'd encourage to see what I'm doing at Inforbix (http://www.inforbix.com). However, I'd love to know more about processes and contextualization. Can you share that? Best, Oleg

  • Sandy Klausner

    >I'd love to know more about processes and
    contextualization.

     

    To put my argument into perspective, let's first qualify
    the Semantic Web specifications challenges:

     

    = Standards fragmentation

      – Major
    vendors currently deviating from W3C's RDF/OWL/Linked Data specs

      – Walled
    gardens prevent the adoption of a common semantic infrastructure

     

    = Weaknesses of Linked Data

      – URIs are
    not friendly

      – Difficult
    to analyze

      -
    Bewildering

      – Verbose
    RDF/OWL

      – Nearly
    impossible to merge ontology

      -
    Triplestores are inefficient to process

      – Difficult
    to secure

      – XML
    format does not clearly depict or delineate between:

       
    + Concepts

       
    + Unstructured data

       
    + Structured data

     

    = Natural language-dependent

      – Words are
    open to multiple interpretations

      -
    Text-based graphs do not easily accommodate change

      – URIs do
    not robustly support foreign languages

     

    = Lacks machine intelligence to:

      – Develop
    Linked Data graphs

      – Analyze
    and link to external data

      – Harmonize
    metadata

     

    A scalable semantic infrastructure requires a format that
    can be built and utilized by both users and software agents

  • Sandy Klausner

    Sorry about the formatting of the last post, but it was entered as plain text!

  • beyondplm

    no problem. I made slight formatting of your comment to make it easy readable. 

  • beyondplm

    Sandy, thanks for commenting! I think you laid a very good list of topics to discuss in the context of technological choice for sem web or linked data. However, let me address it from a different angle. If I stay in the realm of SQL/RDBS, would my challenges will be similar? Fragmentation, inability to merge data models, lack machine intelligence, not friendly data modeling and data languages. The only argument that will stand on the side as a clear diff is that RDBMS has 20+ years proven technological validation. I can agree with that. However, it doesn't mean we don't need to experiment. So, I can see semantic tech experiment as something very exciting these days. Just my opinion. YMMV. Best, Oleg

  • Sandy Klausner

    We need more that “experimentation,” but a
    fundamental shift in the semantic infrastructure that can intrinsically support
    context-aware processing.  By
    placing the meaning of a concept into machine context, real time machine
    inference would be able to 'disambiguate intent.'  Disambiguation capability
    requires a graph and ontology distributed architecture that is not present in
    the Semantic Web.

Previous post:

Next post: