Cambridge Healthtech Institute’s 2nd Annual

AI/ML-Enabled Drug Discovery – Part 1

AI/ML for Identifying Novel Targets and Pathways

September 26 - 27, 2023 EDT

Cambridge Healthtech Institute’s two-part conference on Artificial Intelligence (AI)/Machine Learning (ML)-Enabled Drug Discovery will highlight the increasing use of computational tools, AI modeling, algorithms, and data science for identifying novel drug targets, drug design, virtual screening, lead optimization, and ADME/toxicology assessments. Relevant case studies and research findings will show how and where AI/ML can be successfully integrated and implemented in drug discovery. It will bring together chemists, biologists, pharmacologists, and bioinformaticians to talk about hope versus hype, and to understand the caveats of AI-enabled decision-making. Part 1 of the AI/ML-Enabled Drug Discovery conference will focus on use of AI and ML predictions and modeling for identifying and prioritizing drug targets and cellular pathways to pursue.

Tuesday, September 26

Registration and Morning Coffee7:00 am

Welcome Remarks7:55 am

IMPACT OF AI: REAL-LIFE CASE STUDIES

8:00 am

Chairperson's Remarks

Shruthi Bharadwaj, PhD, Global Lead, Digital & Analytics, R&D Global Operations, Sanofi

8:05 am

Data and Advanced Analytics Driving Innovation Within Pharma

Shruthi Bharadwaj, PhD, Global Lead, Digital & Analytics, R&D Global Operations, Sanofi

Implementation of AI-enabled specialized search engines for scientists is imperative to enhancing user experience. The search engines, powered by natural language processing (NLP) algorithms to understand and interpret scientific language, enables researchers to find relevant information quickly and easily. Using ML techniques, the search engines learn and refine results over time, making them more accurate and efficient. It identifies patterns and relationships in scientific data leading to new discoveries and breakthroughs.

8:35 am

Using AI/ML to Close the Drug Discovery Triangle

Michael Liebman, PhD, Managing Director, IPQ Analytics, LLC

Drug discovery involves three legs of the triangle: Disease, Target, and Drug. Significant efforts focus on identifying targets (network and pathway-based analysis) and in identifying drug candidates (optimizing the drug-protein interaction). Drug researchers rarely deal with current diagnostic limitations, complex disease etiologies, or disease heterogeneity. Novel methods for disease stratification, e.g., knowledge graphs and quantum computing, redefine the disease process; examples from MS, TNBC, and rare diseases will be presented.

9:05 am The Use of Human-Centric AI in Target Identification and Validation

Ramon Perez, Ramon D. Perez, PhD, Scientific Liaison, Scientific Affairs, Causaly

Target identification and validation is heavily reliant on the current scientific literature to understand the disease pathophysiology and target biology. Manually extracting information from publications not only requires tremendous human effort, but also introduces bias in the process. Causaly provides an AI-based solution to support scientists in identifying and qualifying targets. Our technology mines the scientific data and rapidly delivers search results with high precision, solves the knowledge problem in target ID and validation, and breaks ground in human-centric AI to support target research.

Networking Coffee Break9:35 am

10:05 am

Quality Assessment of AI Tools in Image Informatics in Medicine (Case studies in Radiology & Pathology)

Arvind Rao, PhD, Associate Professor, Department of Computational Medicine and Bioinformatics, University of Michigan

Given the wide variety of AI tools currently being deployed in radiology and pathology, it is feasible to bring AI to clinical care. However, the calibration of these tools (i.e, the data, the models and inference) for cost-sensitive situations like healthcare is an important point. Using the example of tumor segmentation task, we will review a few algorithms and describe their potential modes of failure in addition to scoring rubrics related to data and model veracity. This talk will use case studies from these domains to discuss these aspects and their use for adoption of AI.

10:35 am

Multi-Omics and Machine Learning for Interpretable Selection of Candidate Drug Biomarkers and Mode-of-Action Studies

Andrew Jarnuczak, PhD, Associate Principal Scientist, Proteomics, AstraZeneca

Along with target identification, understanding mechanism of action and identification of pharmacodynamic biomarkers are central to early drug discovery. The right information and its intelligent application allow bridging preclinical and clinical development. Here, I will describe how we apply statistics and decision tree-based machine learning to proteomics, transcriptomics, and metabolomics experiments to accelerate those efforts. Through real-life case studies, I will showcase the transformation in data analysis methods that took place in recent years.

11:05 am

Identifying Polyploid Cells in Tissue Images via Instance-Aware Semantic Segmentation

Courtney Rouse, PhD, Research Engineer, Artificial Intelligence, Southwest Research Institute

Cancerous polyploid cells become resistant to chemotherapy and escape cell death. Doctors at University of Texas Health - San Antonio are developing drugs to be administered along with chemotherapy that are effective at reducing the cancerous polyploid cells. Deep learning is used to identify polyploid cells pre- and post-administration of drug candidates to evaluate their effectiveness.

11:35 am Driving Scientific Certainty - Integrating Active AI Into Drug Discovery

Brian Albrecht, Head of Drug Discovery, Related Sciences

Emilio Cordova, Executive Director, Logica, Charles River

Kristopher King, Senior Director, Portfolio & Program Management, Valo Health

Join us in this panel discussion as we gain opinions from drug discovery and AI technology leadership on the evolution of integrated AI driven drug discovery. Where we see its greatest impact as an active learning model across early drug discovery, its potential failure without cerebral guidance and cultural adoption, and the inevitability of its use as we seek to navigate the far reaches of chemical space and deliver greater numbers of therapeutics in less time.  

Transition to Lunch12:05 pm

12:10 pm LUNCHEON PRESENTATION:Accelerate Drug Discovery Using AI, Physics-Based Method, and Automated Synthesis

Sung Eun Jee, PhD, Application Scientist, Medicinal Chemistry, XtalPi, Inc.

The DMTA cycle is the centerpiece of preclinical drug discovery. Medicinal chemists improve compounds from initial hits to drug-like developmental candidates. XtalPi’s mission is to accelerate discovery by reducing iterations. We access large chemical spaces with generative AI algorithms to increase the number of design ideas. Selected molecules are assessed by more accurate physics-based methods before they are executed by our automated synthesis platform.

 

Session Break12:40 pm

COMPUTATIONAL TOOLS FOR PROTAC DESIGN

1:15 pm

Chairperson's Remarks

Michael Liebman, PhD, Managing Director, IPQ Analytics, LLC

1:20 pm

PROTAC Rational Design: A Long Way to Go

Evianne Rovers, Graduate Student, Department of Pharmacology and Toxicology, University of Toronto, Structural Genomics Consortium

We benchmarked the ability of commercial and open-source PROTAC virtual screening tools to 1) predict the ternary complexes observed in crystal structures and 2) dissociate active from inactive PROTACs. These tools are better able to predict near-native ternary complex structures than traditional protein–protein complex prediction software, but PROTAC virtual screening efficiency is unclear and highly variable. More experimental data is necessary for further improvement.

1:50 pm

High Accuracy Prediction of PROTAC Complex Structures

Dima Kozakov, PhD, Associate Professor, Applied Mathematics & Statistics, SUNY Stony Brook

We present a method for generating high accuracy structural models of E3 ligase-PROTAC-target protein ternary complexes and of the full degradation assembly. The method is dependent on two computational innovations: adding a “silent” convolution term to an efficient protein–protein docking program to eliminate protein poses that do not have acceptable linker conformations and clustering models of multiple PROTACs targeting the same target. We validate the approach on known systems, as well as blindly on new PROTACs.

2:20 pm Harmonizing assay data with ontologies curated from ML models

Kelly Bachovchin, PhD, Customer Engagement Scientist, Support, Collaborative Drug Discovery

Collaborative Drug Discovery (CDD) provides a whole solution for today’s biological and chemical data needs, differentiated by ease-of-use and superior collaborative capabilities. An assay annotation technology has been implemented in CDD Vault to align the principles of FAIR data so researchers can create a robust ecosystem where data is readily discoverable and accessible and comes with rich contextual information that facilitates its interpretation and reuse.

2:35 pm From Unmanageable to Meaningful: How AI and Knowledge Graphs Are Helping to Alleviate Information Overload

Hiedi L. Nichols, Senior Client Success Manager, Research Solutions

AI and ML are essential for finding that critical needle in today’s data haystack. In addition to NLP, learn how ResoluteAI enriches heterogeneous data sources with proprietary tagging and medical taxonomies. We'll also explore the role that knowledge graphs have in integrating these data sources to reveal meaningful connections. 

In-Person Group Discussions2:50 pm

In-Person Group Discussions are informal, moderated discussions, allowing participants to exchange ideas and experiences and develop future collaborations around a focused topic. Each discussion will be led by a facilitator who keeps the discussion on track and the group engaged. To get the most out of this format, please come prepared to share examples from your work, be a part of a collective, problem-solving session, and participate in active idea sharing. Please visit the In-Person Group Discussions page on the conference website for a complete listing of topics and descriptions.

IN-PERSON GROUP DISCUSSION 8A:

How Successful Are AI/ML Approaches in Drug Development Today?

Shruthi Bharadwaj, PhD, Global Lead, Digital & Analytics, R&D Global Operations, Sanofi

Michael Liebman, PhD, Managing Director, IPQ Analytics, LLC

Alexander Taguchi, PhD, Director, Machine Learning, Antibody Discovery, iBio, Inc.

  • Growing use of natural language processing, ChatGPT, and other tools 
  • Case studies highlighting use of knowledge graphs and quantum computing in healthcare 
  • Applying statistics, decision tree–based ML to data analysis in drug discovery
  • Effective use of virtual screening and structure-activity predictions tools​
  • AI for driving peptide therapeutics

Grand Opening Refreshment Break in the Exhibit Hall with Poster Viewing3:35 pm

AI FOR PROTEIN STRUCTURE & DESIGN

4:15 pm

Utilizing AI for Solving Novel Structures: Cryo-EM Structure of the PAPP-A IGFBP5 Complex Reveals Mechanism of Substrate Recognition

Russell Judge, PhD, Principal Research Scientist II, Structural Biology, AbbVie

Pregnancy-Associated Plasma Protein A (PAPP-A) is a metalloprotease that regulates Insulin-like Growth Factor (IGF) bioavailability and signaling. Here we present the Cryo-EM structures of holo-PAPP-A and PAPP-A in complex with substrate Insulin-like Growth Factor Binding Protein 5 (IGFBP5), which in combination with biochemical experiments explain the mechanisms of substrate recognition and selectivity. The AI-developed AlphaFold structure predictions for PAPP-A and IGFBP5 were crucial for rapid structure determination in this study. Using the PAPP-A structures as a case study, the benefits and caveats of utilizing AI in novel protein structure solution will be discussed.

4:45 pm

Rapid Discovery of Antibodies That Bind Therapeutic Sites of Interest with Machine Learning-Engineered Immunogens

Alexander Taguchi, PhD, Director, Machine Learning, Antibody Discovery, iBio, Inc.

iBio has developed a machine learning technology for controlling the epitope binding site in antibody discovery. This is accomplished by computational design of peptides that embody the target epitope sequence and structure. These peptides are then used in immunization or in vitro screening to improve the efficiency of discovering antibodies that bind to the therapeutic site of interest.

5:15 pm

Target-Agnostic Generative AI for Peptide Drug Design

Nicholas Nystrom, PhD, CTO, Peptilogics, Inc.

Generative AI can enable drug discovery for known and novel targets alike, potentially overcoming otherwise prohibitive costs and timelines of early discovery in low-data regimes. Peptilogics’ Nautilus AI platform for peptide drug design combines proprietary generative and predictive AI algorithms for hit ID, hit-to-lead, and lead optimization. Nautilus has been applied in-house and through partnerships to diverse targets including a previously undrugged GPCR and therapeutic areas.

Welcome Reception in the Exhibit Hall with Poster Viewing5:45 pm

Close of Day6:45 pm

Wednesday, September 27

Registration and Morning Coffee7:30 am

GENERATIVE AI FOR DRUG DISCOVERY

7:55 am

Chairperson's Remarks

Tudor Oprea, MD, PhD, CSO, Expert Systems, Inc.

8:00 am

Data Science and Informatics Meet AI: A Journey of Drug Discovery

Tudor Oprea, MD, PhD, CSO, Expert Systems, Inc.

Data, information, and knowledge are as critical as algorithms in drug discovery. Combining these elements, we explore AI models and their utility in the drug-target-disease space. We discuss lessons learned along this multidisciplinary journey.

8:30 am

A ChatGPT for Drug Discovery: One Model Versus Many

Sean Ekins, PhD, Founder & CEO, Collaborations Pharmaceuticals, Inc.

Recently, large multi-task networks have been trained using transfer-learning, in which a related task is trained alongside the primary task(s), with the model then gaining predictive performance on the primary task. We have explored three large multi-task model architectures: graphSAGE, Conv-LS, and MolBART with a range of databases including ChEMBL. We will describe how these models can be used for drug discovery and generative de novo design of molecules. 

9:00 am

Linking Biology, Chemistry, and Medicine with Robotics and Generative Reinforcement Learning for Efficient Drug Discovery

Kyle Tretina, PhD, Principal Scientist and Alliance Manager of AI Platforms, Target Discovery, Insilico Medicine

AI is transforming many steps of drug discovery and drug development but most of the efforts in target discovery, chemistry, preclinical, clinical development are disconnected. Efficient design of fully connected AI workflows utilizing external and internal data sources and active learning using robotics-generated data allows for identification of paths of least resistance and with a high probability of success. This talk will present a connected generative AI pipeline achieving human-level validation.

9:30 am PANEL DISCUSSION:

New AI Frontiers and Their Impact on Drug Development

PANEL MODERATOR:

Tudor Oprea, MD, PhD, CSO, Expert Systems, Inc.

PANELISTS:

Sean Ekins, PhD, Founder & CEO, Collaborations Pharmaceuticals, Inc.

Kyle Tretina, PhD, Principal Scientist and Alliance Manager of AI Platforms, Target Discovery, Insilico Medicine

Coffee Break in the Exhibit Hall with Poster Viewing10:00 am

PLENARY KEYNOTE PROGRAM

10:40 am

Plenary Chairperson’s Remarks

An-Dinh Nguyen, Team Lead, Discovery on Target, Cambridge Healthtech Institute

10:45 am

PLENARY: The New Science of Therapeutics

Jay E. Bradner, MD, Physician Scientist, Former President, Novartis Institutes for BioMedical Research, Inc.

I will share reflections on how new paradigms in the science of therapeutics are creating opportunities to approach historic challenges in medicine. Specifically, I will share approaches to targeting transcription factors and discuss how modularity is a paradigm for next-generation low-molecular weight and biological therapeutics. Finally, I will offer reflections on drug development and the fitness, opportunities, and challenges of the biomedical ecosystem.

11:30 am

PLENARY: Accelerating Drug Discovery Using Machine Learning and Cell Painting Images

Anne E. Carpenter, PhD, Senior Director, Imaging Platform & Institute Scientist, Broad Institute

Shantanu Singh, PhD, Senior Group Leader, Machine Learning, Imaging Platform, Broad Institute

Microscopy images can reveal whether a cell is diseased, is responding to a drug treatment, or whether a pathway has been disrupted by a genetic mutation. In a strategy called image-based profiling, often using the Cell Painting assay, we extract hundreds of features of cells from images. Just like transcriptional profiling, the similarities and differences in the patterns of extracted features reveal connections among diseases, drugs, and genes.

Close of AI/ML-Enabled Drug Discovery – Part 1 Conference12:15 pm