Map of Life
Content users
Video Player is loading.
Current Time 0:15
Duration 0:20
Loaded: 100.00%
Stream Type LIVE
Remaining Time 0:05
 
1x
    • Chapters
    • descriptions off, selected
    • captions off, selected
    • default, selected
    Science
    00

    Digital clear glass for the future of medicine

    When Galileo Galilei made his groundbreaking astronomical  discoveries four centuries ago, the technology helping him disrupt our understanding of the world had been invented in Venice a few years earlier. There, the famous Murano glassblowers had developed clear glass, which allowed for the construction of lenses for telescopes and microscopes – instruments which helped rewrite everything from astronomy to biology. The Novartis digital research platform data42 hopes to be as groundbreaking.

    Text by Goran Mijuk, illustrations by Philip Bürli, photos by Björn Myhre

    scroll-down
    Home
    en
    de
    zh
    jp
    Share
    Share icon
    content-image
    Enter fullscreen

    Connecting the medical dots to create a holistic understanding of human biology.

    arrow-rightMap of Life
    arrow-rightContent users

    Back in the summer of 2019, I met with Achim Plueckebaum, Peter Speyer and Pascal Bouquet, who were part of a team that just a few months earlier had launched the Novartis digital research and development platform data42.  

    Their main office, located in an old lab building on the Campus in Basel, was near-empty and, for an outsider, looked a bit forlorn. But the long rows of desks, each equipped with large computer screens connected to the data centers of Novartis around the world, spoke of high ambition and big things to come.  

    2019 was a peak year for digital exploration. New digital tools and the promise of artificial intelligence saw the pharma sector rush into a digital El Dorado. The hope was to transform conventional drug discovery and reduce the painfully long and costly drug development times with the help of powerful algorithms and superfast computers.   

    The general idea was as follows: data42 should bring the massive, yet virtually untapped clinical and scientific data of Novartis, which encompassed nearly 2 million patient years of data, to life and help the company find new scientific insights to develop drugs faster.

    But despite this lofty vision, the data42 team was aware that the task ahead would be fraught with major challenges, and the team was sure that – over time – they would meet with growing resistance if data42 did not live up to the hype that was raging around them.  

    Focus on the doable

    They tried to keep a cool head, shied away from overpromising and focused on what is doable. Even the name data42 – a reference to Douglas Adams’ novel The Hitchhiker’s Guide to the Galaxy – was a reference to humility as it stressed that computer power can only be unlocked with the right – human – question.

    The data42 team, which quickly grew to some 150 members, set themselves short-term targets to produce palpable results within a manageable timeframe. This strategy was instrumental in moving the project forward because the massive amount of data, which spanned thousands of clinical trials and millions of data points from years of research, would have been too heavy to lift in one go.

    But even as the team took a step-by-step approach, the task was gargantuan. Anyone who has tried to combine Excel sheets using different formats may have at least a rough idea what a painstaking task the team faced.

    In the case of data42, millions of data points had to be cleaned. One example, for instance, was to find a common denominator for a patient’s gender for the thousands of clinical trials conducted by Novartis in the past.

    Some trials would mark a patient’s gender with an “f” or an “m,” in others “masculine” or “feminine” would be used, others again used the terms “male” and “female.” To make the data machine-readable, this needed to be fixed first.

    The same problem would arise in other instances, be it a patient’s race, their medical history or the actual clinical data, which could differ widely depending on where the trial was conducted. In short, the team had to clear up the mess.

    “All in all, we knew that the beginning would be very hard as the initial lift was to clean the data and then produce results as fast as we could in order to win the trust of our partners and prove that we can leverage our pool of medical, clinical and research data,” says Plueckebaum recalling the early days of data42.

    content-image
    Enter fullscreen

    Pascal Bouquet, Gabriel Eichler, Sam Khalil, Christian Diehl and Achim Plueckebaum.

    Map of Life

    One of the first projects was in the field dermatology, where the team was looking into existing clinical trial data to find correlations with other disease areas that could help drug development teams find potential new indications for existing medicines.

    While the search failed to deliver the hoped-for data nuggets, the team learned enough to push the project forward, even though the initial setback compelled the team to adjust its initial strategy.

    However, as word spread that Novartis was accelerating its data efforts, a host of digital experts and external partners joined the team, helping data42 move forward fast and overcoming initial challenges.

    An early joiner to the team was Gabriel Eichler, Head of Data at data42. For him, the promise of data science is that it can unlock patterns that until recently have been completely overlooked by science.

    “In many ways, what we are doing to data42 is similar to what the Venetian glassblowers did when they invented clear glass: This gave rise to telescopes and microscopes, which would completely change the course of science and history,” Eichler explained. “It’s like the groundwork for things to come, which we have no idea about, like the internet or e-mail that gave rise to completely new industries.”

    While cleaning up the data was one thing, combining the different data sets together was another huge challenge. “But,” Eichler said, “only through this combination can data really live up to its promise. The data pool that we were able to establish could allow scientists to learn ever more about the interconnectedness between biology and diseases.”

    With this vision in mind, the team built the first core product of data42 that is now called the Map of Life in less than two years. While the time frame sounds short, the process was anything but smooth. It included a series of strategic changes, major personal efforts of some of the key team players as well as technical support from external partners.

    But the heavy lift was worth the effort: The system now not only includes data from nearly 3000 clinical trials, but also encompasses research and genomics data, including 1.6 trillion genomic variants, as well as blood and tissue samples and other medical data, helping scientists deepen their understanding of diseases and find correlations between data that were so far unknown.

    Synthetic trials

    The advantage of this centralized map can hardly be appreciated without knowing how clinical data has been handled for decades.

    “Normally, if a scientist wanted access to certain clinical data in the past, this would entail not only a long search to find the right person owning the data, but also discussions about how the data would be used. This process alone could easily take weeks if not months,” says Sam Khalil, Head of Science at data42.

    Breaking down these structural silos with the help of the Map of Life, scientists can now access this data in the blink of an eye, also because the data42 team made sure that all patient data is fully protected.

    As Sam Khalil walks me through the platform’s functionalities, the efficiency gains are immediately evident. If one wants to know how many male and female patients participated in clinical trials, the result is ready within a fraction of a second. Just two years ago, such an effort would be worth a doctoral thesis …

    “Trial data – including genomics and proteomic data – can now be analyzed across the entire clinical trial universe of Novartis and give insights that otherwise would have been difficult if not impossible to collect in the past,” Khalil explained. “But the real gain is that now all of the clinical data can be interlinked in ways unthinkable before, which will allow us to even run synthetic trials in the near future.”

    Normally, in order to find out whether a certain medicine can be used in a particular indication, drug development teams have almost no other options than to run actual clinical trials. Now, this, in future, could be partly done with the help of data42, which can provide important insights into trial feasibility.

    “In one instance, colleagues who were running a trial ran into difficulties recruiting patients and asked us to investigate the reasons for this,” Khalil recalled. “After we ran our analysis, we were able to show that the patient population in the trial setup was too narrow. This helped the team to refocus the trial, which helped them make the right decisions.”

    content-image
    Enter fullscreen

    Pascal Bouquet

    Con­tent users

    So far, some 250 early users across Novartis have tapped into data42’s Map of Life. Two of them are scientists from the Novartis Institutes for BioMedical Research, Michael Beste and Jason Laramie, who expect huge efficiency gains from the system.

    “I’ve used the system several times already and I am still amazed at what has been achieved in such a relatively short time,” Laramie says. “In the past, it took weeks, even months to organize and analyze certain data sets. Now, this can be done within a few days, even hours.”

    While Laramie expects data42 to gather pace over time and credits the fast setup of the system both to the technical skills of his colleagues and to the determination of Novartis to fund this cross-divisional unit, his colleague Michael Beste envisions that researchers “will have more shots on goal in quick succession … and will be able to broaden the hypothesis spectrum.”

    Colleagues from Global Drug Development are equally enthusiastic about the Map of Life. Janice Branson expects that with the new system “silo thinking” will be overcome and that the system, which today already hosts a massive data set, including MRI scans, X-rays and other data, will become even more powerful with the incorporation of real-world data.

    “Today, data42 is already helping to tear down the silos,” Branson said. “But I expect it to get more efficient with time once we learn how our medicines perform in real-world settings.” Hence, one of the next big things the data42 team is working on is to fully integrate and connect real-world data with clinical data from Novartis.”

    Work in progress

    The data42 journey so far has been a great achievement, even if there were periods with setbacks and strategic turns. Plueckebaum, Speyer and Bouquet, who experienced the journey from day one, say the team’s ability to come so far was down to its flexibility to pivot when required, listen to the needs of scientists and take bold action when necessary. Likewise, they are convinced, leadership support and a clear-cut strategic vision, especially in times of crisis, was and is equally important.

    While data and digital are deeply ingrained in the strategy of Novartis, one of the team’s strongest supporters has not only a long-term view on data and digital but is putting data42’s efforts in a wider medical context. “If you look at the history of medicine, this is a fairly young science that only started some 100 years ago,” Badhri Srinivasan, Head Global Development Operations and Executive Sponsor at data42, explained. “If you compare it to other fields such as physics, which has been around for millennia, we are only at the beginning.”

    In order to really push medicine forward, data and digital will be instrumental because of their ability to connect different medical fields. “Many medical breakthroughs in the history of medicine have been achieved in isolation,” Srinivasan specified. “Connecting the dots has been hard so far. But we all know that medicine can only be improved if we have a holistic understanding of our body and the underlying biology. This is really where data is needed and will be essential going forward.”

    Against this backdrop, data and digital will become ever more important, also because data42 is another major step toward creating a patient-centric organization that aspires to create truly personalized medicines. “Against this larger backdrop, data42 will continue to grow conceptionally because there is so much data out there that needs to be connected,” Srinivasan said. “This may be a hard lift. But, as our first cases show, a worthwhile and important road to take as it opens up so many possibilities, including our efforts to create truly personalized medicines that take individual patient needs into account.”

    icon

    Home
    en
    de
    zh
    jp
    Share
    Share icon