Breaking Data Silos – From Static Documents to Living Data
This Webinar demonstrates how Data Revival applies cutting edge computer vision and domain specific AI to unlock chemical “dark data” trapped in documents, patents, and historical notebooks, transforming it into structured, interoperable, AI ready assets that accelerate discovery and dissolve traditional data silos.
Abstract
In the physical sciences, critical R&D knowledge remains trapped in a vast array of static formats—ranging from handwritten lab notebooks and internal company reports to academic literature and patent filings. This fragmentation creates data silos that hinder innovation and prevent the deployment of modern digital tools.
In this webinar, Data Revival will demonstrate how advanced computer vision and domain-specific AI can transform this “dark data” into living, interoperable assets. We will showcase our specialized Optical Chemical Structure Recognition (OCSR) technology, which is uniquely capable of interpreting the full spectrum of chemical representations: standard printed molecules, complex Markush structures found in intellectual property, and hand-drawn structures from legacy notebooks.
Attendees will see this technology applied in real-world scenarios, including the creation of comprehensive reaction databases from thousands of Organic Process Research & Development (OPRD) papers , the systematic parsing of patent landscapes, and the automated extraction of historical notebook data to populate modern Electronic Lab Notebooks (ELNs). Join us to learn how we bridge the gap between static archives and AI-ready infrastructure.
Biography
Samuel Munday is the CEO and co-founder of the University of Southampton spin out Data Revival. He first became interested in scientific data management whilst building a predictive analytics platform for polymeric materials and realising that a lot of key data resided in a form incomprehensible to computers. This has led to the development of a series of tools for unstructured chemical data extraction and structuring, mainly used for turning hand written lab notebooks, patents and academic literature into structured searchable databases at scale. These services are now being used by large multinational chemical and pharmaceutical companies in areas as diverse as semiconductors, polymers and drug discovery.
Watch the recording
You can watch this recording via our You Tube channel.
What to do next
- Watch another Webinar from our list
- Take a look at our self-paced learning
- Try our Tutorials
- Find current and past in-person training opportunities on the Events page on the PSDI website
Related links:
- Creator: Cerys Willoughby
- Last modified date: 2026-03-20
- License: CC-BY-4.0
- Citation: Please cite: Cerys Willoughby, Breaking Data Silos – From Static Documents to Living Data, https://guidance.psdi.ac.uk/docusaurus-pages/docs/training/webinars/mlips/, PSDI (modified 2026-03-20)
If you would like to contribute content to the PSDI Knowledge Base or have feedback you would like to give on this guidance, please contact us.