-
LifeSciences Design Platform
Our long-term goal is to build a “LifeSciences Design Platform” that extracts both meaning as well as actionable knowledge from the mountains of Open Data available in the life sciences space. Here are a couple of diagrams that outline our vision for an integrated, full-lifecycle view of that Open Data. We believe we can integrate…
-
Customized Synthetic Data
In this article we cover how DataSDR generates customized Synthetic Data to improve your software development and debugging processes. You can download a paper we presented at the PharmaSUG 2021 event. There are several types of data we provide our clients: “Open Data” vs. “Green Data” vs. “Red Data.”* “Open Data” refers to data freely…
-
Using Open Data to Improve LLMs
In this article we’ll explore how Open Data can be used to improve Large Language Models (“LLMs”). We wrote a full paper on using our Open Data Repository to train an LLM model, please download it here. First of all, we believe that using Open Data provides your models with a massive amount of “ground…