Skip to main content

Breaking Data Silos – From Static Documents to Living Data

This Webinar demonstrates how Data Revival applies cutting edge computer vision and domain specific AI to unlock chemical “dark data” trapped in documents, patents, and historical notebooks, transforming it into structured, interoperable, AI ready assets that accelerate discovery and dissolve traditional data silos.

Abstract

In the physical sciences, critical R&D knowledge remains trapped in a vast array of static formats—ranging from handwritten lab notebooks and internal company reports to academic literature and patent filings. This fragmentation creates data silos that hinder innovation and prevent the deployment of modern digital tools.

In this webinar, Data Revival will demonstrate how advanced computer vision and domain-specific AI can transform this “dark data” into living, interoperable assets. We will showcase our specialized Optical Chemical Structure Recognition (OCSR) technology, which is uniquely capable of interpreting the full spectrum of chemical representations: standard printed molecules, complex Markush structures found in intellectual property, and hand-drawn structures from legacy notebooks.

Attendees will see this technology applied in real-world scenarios, including the creation of comprehensive reaction databases from thousands of Organic Process Research & Development (OPRD) papers , the systematic parsing of patent landscapes, and the automated extraction of historical notebook data to populate modern Electronic Lab Notebooks (ELNs). Join us to learn how we bridge the gap between static archives and AI-ready infrastructure.

Biography

Samuel Munday is the CEO and co-founder of the University of Southampton spin out Data Revival. He first became interested in scientific data management whilst building a predictive analytics platform for polymeric materials and realising that a lot of key data resided in a form incomprehensible to computers. This has led to the development of a series of tools for unstructured chemical data extraction and structuring, mainly used for turning hand written lab notebooks, patents and academic literature into structured searchable databases at scale. These services are now being used by large multinational chemical and pharmaceutical companies in areas as diverse as semiconductors, polymers and drug discovery.

Watch the recording

You can watch this recording via our You Tube channel.

What to do next

Related links:


About this page

If you would like to contribute content to the PSDI Knowledge Base or have feedback you would like to give on this guidance, please contact us.