• LifeSciences Design Platform

    ·

    LifeSciences Design Platform

    Our long-term goal is to build a “LifeSciences Design Platform” that extracts both meaning as well as actionable knowledge from the mountains of Open Data available in the life sciences space. Here are a couple of diagrams that outline our vision for an integrated, full-lifecycle view of that Open Data. We believe we can integrate…

  • Customized Synthetic Data

    ·

    ,

    Customized Synthetic Data

    In this article we cover how DataSDR generates customized Synthetic Data to improve your software development and debugging processes. You can download a paper we presented at the PharmaSUG 2021 event. There are several types of data we provide our clients: “Open Data” vs. “Green Data” vs. “Red Data.”* “Open Data” refers to data freely…

  • Using Open Data to Improve LLMs

    ·

    ,

    Using Open Data to Improve LLMs

    In this article we’ll explore how Open Data can be used to improve Large Language Models (“LLMs”). We wrote a full paper on using our Open Data Repository to train an LLM model, please download it here. First of all, we believe that using Open Data provides your models with a massive amount of “ground…