
The Columbia Center of AI Technology (CAIT) in collaboration with Amazon was founded in 2020 with a mission to better society through the development and adoption of advanced AI technologies contributing to a more secure, connected, creative, sustainable, healthy, and equitable humanity.
Â
In keeping with CAIT's mission, our 2023 CAIT Symposium envisions an AI-enabled partnership between patients and clinicians. The event will explore the gaps between current capabilities and needs, with the goal of identifying and prioritizing research opportunities to address these challenges.
Â
Affiliates of Columbia, Amazon, and other Amazon Science Hubs are invited to join us at Columbia for this event.
Join us for breakfast and registration at 9:00am, followed at 9:30am by presentations from leading scholars and practitioners in healthcare and AI set the stage for further discussion and collaboration throughout the day.
Â
Speakers: David Sontag (MIT); Sunita Mishra (Amazon Health Services); Carri Chan (Columbia Business School)
Join leaders from Columbia and Amazon for a panel discussion.
Speakers: Sunita Mishra (Chief Medical Officer, Amazon Health Services), Katrina Armstrong (Dean, Vagelos College of Physicians and Surgeons, Columbia Irving Medical Center), Andrew Diamond (Chief Medical Officer, One Medical)
Moderator: Shih-Fu Chang (Columbia Engineering)
Grab lunch and join one of four parallel breakout discussions, moderated by faculty and industry leaders in AI and healthcare.
Breakout 1 | Interaction: How might we create interactive intelligent systems for re-imagined patient-clinician encounters?
Location: Geffen 440 (4th Floor)
Moderated by Lena Mamykina (CUIMC) and Sunita Mishra (Amazon Health Services)
Â
Breakout 2 | Implementation: How might we create better ways to study AI interventions in healthcare?
Location: Geffen 420 (4th Floor)
Moderated by Sarah Rossetti (CUIMC) and Charles Kim (Amazon Health Services)
Â
Breakout 3 | Generative AI: How might we create generative AI models that are useful in healthcare settings?
Location: Geffen 590 (5th Floor)
Moderated by Shalmali Joshi (CUIMC) and Alex Woody (Amazon Health Services)
Â
Breakout 4 | Trustworthy AI : How might we enable trustworthy and responsible AI in healthcare?
Location: Geffen 570 (5th Floor)
Moderated by Gamze Gursoy (CUIMC) and Anna Rumshisky (Amazon Alexa/UMass Lowell)
David Sontag is a Professor in the Department of Electrical Engineering and Computer Science (EECS) and von Helmholtz Professor in the Institute for Medical Engineering and Science (IMES). He is also a principal investigator in the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT. Dr. Sontag's research interests are in machine learning and artificial intelligence. As part of IMES, he leads a research group that aims to transform healthcare through the use of machine learning. Dr. Sontag joined MIT in 2017 from New York University, where he was Assistant Professor in Computer Science and Data Science from 2011 to 2016, and before this he was a postdoctoral researcher at Microsoft Research New England from 2010 to 2011.
Douglas Weibel joined Amazon in 2014 at the inception of the Grand Challenge organization and became its General Manager in 2022. He leads teams focused on building net new businesses for Amazon. He has worked on a wide range of impactful projects spanning global health (protecting babies, moms), global agriculture (food security), infectious disease detection and management, co-founding startups in human fertility and veterinary diagnostics (one of which was acquired by a Fortune 100 company), and advising corporate, government, and defense leaderships on a wide variety of topics (e.g., COVID strategies and science for Jeff Bezos and Amazon’s Senior Leadership Team, and President Biden’s transition team). Along the way, he has developed a deep interest in areas spanning food security, energy/clean-tech, sustainable (‘green’) manufacturing, applications of compute and statistical methods, biotechnology, and healthcare. Douglas holds a PhD in chemistry from Cornell, and a postdoctoral fellow in chemistry and engineering at Harvard. From 2006-2018 he was DuPont Professor in the Departments of Biochemistry, Chemistry, and Biomedical Engineering at the University of Wisconsin-Madison.
Dr. Sunita Mishra is the Chief Medical Officer for Amazon Health Services, which aims to make it dramatically easier to find, choose, afford, and engage with the services, products and professionals needed to get and stay healthy through offerings like Amazon Pharmacy, Amazon Clinic, and One Medical. Throughout her career, Dr. Mishra has been focused on developing models of care that create a sustainable clinician experience while delivering on a more high-quality, human experience for patients. She’s an internist and has practiced as a primary care physician for over 20 years in both the U.S. and Singapore. Prior to her current role as Chief Medical Officer for Amazon Health Services, Dr. Mishra scaled and ran the medical group for Amazon Care. She joined Amazon after spending eight years at Providence, where she served in several roles—including Vice President for Consumer Innovation and Chief Executive of Express Care. Dr. Mishra received her BA and MD from the University of Arizona, her clinical training from the University of Washington, and her MBA from The Wharton School at the University of Pennsylvania.
Carri Chan teaches the core MBA class, Operations Management. Her primary research interests are in data-driven modeling of complex stochastic systems, dynamic optimization, and queueing with applications in health-care operations management. Her current focus is on combining empirical approaches with mathematical modeling to develop evidence-based approaches to improving patient flow through hospitals, and particularly intensive care units.
Dean, Fu Foundation School of Engineering and Applied Science and Morris A. and Alma Schapiro Professor; Professor of Electrical Engineering and Computer Science
Panel Moderator
Shih-Fu Chang is dean of Columbia Engineering, where he leads the education, research, and innovation mission. At Columbia Engineering, Dean Chang is responsible for developing cross-disciplinary initiatives and industry collaborations; recruiting top faculty and students; and leading the development and implementation of the school’s strategy for diversity, equity, and inclusion. He has greatly contributed to the growth and advancement of the School, propelling it to be one of the top engineering programs in the nation. He plays a key role in the creation of the School’s guiding vision, Columbia Engineering for Humanity.
 Andrew Diamond serves as One Medical’s Chief Medical Officer and is a practicing primary care physician. Since joining One Medical in the company's infancy in 2007, he has been a driving force in developing the practice’s philosophy and approach to care, designing its clinical systems, and maintaining its human-centered culture. Dr. Diamond has served as the company’s Chief Medical Officer since 2019, ensuring its clinicians have joyful and fulfilling careers and deliver the highest-quality care and service to One Medical members. Dr. Diamond holds a BS in biological sciences from Stanford University, and an MD as well as a PhD in Microbiology and Immunology from the University of Colorado. He completed his medical training at Stanford and is certified with the American Board of Internal Medicine.
Dean of the Vagelos College of Physicians and Surgeons, Executive Vice President for Health and Biomedical Sciences, Columbia University
Panelist
Katrina Armstrong, MD, leads Columbia University’s medical campus as the Chief Executive Officer of CUIMC, which includes the Vagelos College of Physicians and Surgeons (VP&S), the School of Nursing, the College of Dental Medicine, and the Mailman School of Public Health. She also is Executive Vice President for Health and Biomedical Sciences for Columbia University and Dean of the Faculties of Health Sciences and the Vagelos College of Physicians and Surgeons. As VP&S dean, Dr. Armstrong leads the nation’s second oldest medical school and the first to award an MD degree. She is an internationally recognized investigator in medical decision making, quality of care, and cancer prevention and outcomes, an award-winning teacher, and a practicing primary care physician.
Mamykina’s broad research interests include an individual’s sense-making and problem-solving in context of health management, collective sense-making within online health support communities, clinical reasoning and decision-making, communication and coordination of work in clinical teams, and ways to support these practices with informatics interventions. She also focuses on analysis of health information technologies and how they are used among critical care teams, as well as social computing platforms for facilitating knowledge sharing within clinical communities, and within online health support groups.
Dr. Rossetti research is focused on identifying and intervening on system-level weaknesses – particularly those related to poor communication and care coordination – that increase patient risk for harm within our healthcare system by applying computation tools to mine and extract value from electronic health record (EHR) data and leveraging user-centered design of patient-centered and collaborative decision support tools. Dr. Rossetti is an experienced critical care nurse and holds a PhD in Nursing Informatics from Columbia University School of Nursing. She was a National Library of Medicine Post-Doctoral Research Fellow at Columbia University’s Department of Biomedical Informatics.
Shalmali Joshi is an Assistant Professor of Biomedical Informatics at Columbia University. Previously, she was a Postdoctoral Fellow at the Center for Research on Computation and Society at Harvard University, as well as a Postdoctoral Fellow at the Vector Institute. She received her Ph.D. from the University of Texas at Austin (UT Austin). Her research is on the algorithmic safety of Machine Learning for human-centered domains. Shalmali has contributed to the field of explainability, robustness, and novel algorithms for ML safety with an emphasis on practical generative settings and impact on decision-making. Shalmali has published in ML and inter-disciplinary venues in healthcare such as NeurIPS, FAccT, CHIL, MLHC, PMLR, and perspectives in JAMIA, LDH, and Nature Medicine. She has co-founded the Fair ML for Health NeurIPS workshop, General Chair for ML4H 2022, and Program Chair for MLHC 2022.
Charles Kim is the product lead for AI at Amazon Health Services, and develops AI models and applications that enable high-quality, human-centric experiences at Amazon Pharmacy, Amazon Clinic, and One Medical. His mission is to leverage AI to make what should be easy, easy for both customers and clinicians, freeing up valuable time for deeper, more meaningful human interactions in care delivery. Since joining Amazon Health, he has been a driving force in innovating new technologies such as generative AI, to make it easier to find, choose, afford, and engage with personalized care. Before his current role, he led digital and physical product transformations across various industries for Amazon, including healthcare, retail, industrials, consumer electronics, and supply chain. Charles holds a BS in Industrial and Operations Engineering from the University of Michigan.
Alex is a Principal Engineer in Amazon Health Services. As a Senior Manager he led the organization that built and owned the tech behind Amazon Care’s clinic and business operations. This included Software Engineering, Clinical Informatics, Data Science, Business Intelligence and IT. This team included roughly 70 people and managed Care’s clinician portal, EHR interfaces, Data Warehouse, Patient-facing triage tools and administration of clinician laptops. Previously he worked on knowledge graphs, record linking, ontology development and entity resolution in the Alexa organization. He was a founding member of the team that built Alexa’s central knowledge ingestion and conflation system. Today he focuses on Privacy, Data Governance and GenAI. Alex holds an MS in Computer Science, and twin BAs in Anthropology and Spanish.
 Anna Rumshisky is an Associate Professor of Computer Science at the University of Massachusetts Lowell, where she leads the Text Machine Lab for natural language processing. Her primary research area is deep learning for natural language processing, with a focus on representation learning and model interpretability. Since 2020, she has been a Visiting Academic at Amazon Alexa AI. She was previously a postdoctoral fellow at MIT CSAIL and received a PhD in Computer Science from Brandeis University in 2009. She is a recipient of the NSF CAREER award in 2017 and the best thematic paper award at NAACL-HLT 2019. Her research has been funded by the NSF, NIH, Army Research Office, among others.
Gamze Gürsoy is an Assistant Professor at the Department of Biomedical Informatics at Columbia University and a Core Member at the New York Genome Center. She is also affiliated with the Department of Computer Science. Her research group develops privacy-preserving tools to analyze and understand large-scale omics data in relation to diseases and phenotypes with a particular interest in developing software, file formats, and pipelines that enable broad sharing and analysis of sensitive genotypic and phenotypic data in public servers. Her lab also develops tools to map the regulatory wiring of the chromatin to interrogate the effect of three-dimensional organization of genomes on molecular phenotypes. The computational work in the lab is supported by experimental approaches, creating opportunities for trainees in cross-disciplinary studies.
Getting Patients and Clinicians the Information they Need, when they Need it: The large amount of time clinicians spend sifting through patient notes and documenting in electronic health records (EHRs) is a leading cause of clinician burnout. By proactively and dynamically retrieving relevant patient information, we can reduce the effort required to find relevant patient history. First, we show how to use EHR audit logs for machine learning as a source of supervision of note relevance in a specific clinical context, at a particular point in time. We describe our evaluation in two clinical settings, the emergency department and oncology. Second, we show how large language models can be used to accurately extract information from clinical notes, and talk about their potential to help patients better understand their clinical notes. Finally, we show how to use machine learning to better organize and highlight relevant information in a patient's medical records, both by creating problem-oriented views and by automatically extracting action items for clinician and patient follow-up.
Long-form clinical summarization of hospital admissions has real-world significance because of its potential to help both clinicians and patients. The faithfulness of summaries is critical to their safe usage in clinical settings. To better understand the limitations of abstractive systems, as well as the suitability of existing evaluation metrics, we benchmark faithfulness metrics against fine-grained human annotations for model-generated summaries of a patient's Brief Hospital Course. We create a corpus of patient hospital admissions and summaries for a cohort of HIV patients. Annotators are asked to categorize manually highlighted summary elements into one of three categories: ``Incorrect,'' ``Missing,'' and ``Not in Notes.'' We meta-evaluate a broad set of faithfulness metrics and, across metrics, explore the importance of domain adaptation (e.g. the impact of in-domain pre-training and metric fine-tuning), the use of source-summary alignments, and the effects of distilling a single metric from an ensemble of pre-existing metrics. Off-the-shelf metrics with no exposure to clinical text correlate well yet overly rely on summary extractiveness. As a practical guide to long-form clinical narrative summarization, we find that most metrics correlate best to human judgments when provided with one summary sentence at a time and a minimal set of relevant source context.
Classifying Neuropathologically Confirmed Alzheimer’s Disease and Lewy Body Dementia on Clinical Neuroimaging with Deep Learning
AD and DLB co-occur frequently, but post-mortem pathology reveals limited ante-mortem accuracy in the clinical diagnosis of AD comorbid with DLB (AD/DLB) against AD alone. We tested whether a single T1-weighted (T1) MRI scan may differentiate AD/DLB and AD using heterogeneously acquired (both research and clinically acquired) neuroimaging, to maximize the amount of pathology-confirmed in-vivo MRIs. T1 MRI scans are limited to those participants with neuropathology. We examine two groups, an AD with and without LBD pathology. Slice-level 2D Convolutional neural networks (CNN) are trained to differentiate the two groups. From each scan, gray matter is segmented, normalized, smoothed and thresholded and used as input. On the subject-level, CNN records a classification accuracy of 82% and f1 score of 0.79. Prediction accuracy was higher when the scan date is closer to DOD, with a 100% accuracy recorded within a year from the date-of-death. This study demonstrates how machine learning approaches can help harmonize neuroimaging data from clinical sources to better understand disease diagnosis and progression using a true gold-standard of neuropathological confirmation. The frameworks utilized here can be extended to other diseases that are frequently co-occurring and feasibly extend to single scan diagnostic clinical utility of scans.
Leveraging Ophthalmologist Eye Gaze Data for Self-Supervised Training of a Glaucoma Detection Transformer Model
One of the biggest challenges in artificial intelligence for healthcare lies in the acquisition of large, accurately labeled datasets essential for deep learning (DL) model training. The scarcity of such labeled data hinders the development of generalizable models, trained on sufficiently-large datasets to perform well on unseen data. Additionally, disparities in expert opinions (for example for eye diseases like glaucoma, where there is disagreement even among clinicians on what constitutes disease) can impede establishing reliable ground truth for training. To address these issues, we use multimodal inputs: optical coherence tomography (OCT) reports and eye tracking of expert ophthalmologists as they viewed these OCT reports for glaucoma diagnosis, to enable the robust training of DL models via self-supervision derived from clinician eye movements. Eye tracking data offers a wealth of information regarding the focus of attention and the expertise level of individuals examining medical reports. In this study, we created a region-based eye movement encoding and trained a transformer model via contrastive triplet loss to learn to separate OCT reports classified as healthy from OCT reports classified as glaucoma. This approach leveraging eye tracking ‘pseudo-labels’ has the potential to enhance the performance DL models for disease diagnosis with few explicit labels.
The healthcare and life sciences sector accounts for over 30% of global data generation. Yet, approximately 97% of them remains untapped due to storage across diverse formats, as highlighted by the Deloitte report. The primary challenge is cleaning and integrating the data. To tackle this, we introduced AutoTransform, a system utilizing LLM to automatically assess, clean and transform the data. AutoTransform provides a user-friendly interface, allowing users to easily comprehend, verify, and oversee the transformation process. Once cleaned, the data can be efficiently utilized in subsequent LLM-based pipelines for tasks such as data integration and analysis.
Glaucoma Progression Detection and Humphrey Visual Field Prediction Using Discriminative and Generative Vision Transformers
"Glaucoma is one of the top causes of blindness worldwide. Assessing its progression is critical to determine potential visual impairment and to design sound treatment plans. Standard automated perimetry tests, commonly known as visual field (VF) tests, are clinically used to evaluate the state of functional vision. To provide an accurate and automatic diagnostic tool for clinical decision making in glaucoma progression, we utilize the predictive power of artificial intelligence (AI) and propose two vision transformer (ViT)-based deep learning (DL) networks.
First, we optimize a spatiotemporal ViT to classify a subject’s rate of glaucoma progression (GP) using only 3 baseline VFs; we explore threshold mean deviation (MD) rate of change from -0.3 to -1.5 dB/year and achieve up to 89% GP detection accuracy. Second, we develop a VF-to-VF generation architecture via a diffusion model with a ViT backbone. The model predicts future VFs with Pointwise Mean Absolute Error (PMAE) as low as 2.15 dB for mild VF deficits and is the first to extend VF prediction up to 10 years into the future. Our models are trained and validated on our ‘62K+’ dataset, the largest available of VFs to-date including at-risk, minority populations, thus ensuring our models’ generalizability. We establish our computational methods and compare testing results on the publicly available UWHVF dataset. In short, our study utilizes novel AI methods for predicting future rates and patterns of glaucoma progression in order to expedite timely treatment for better patient quality of life. The code is available at https://github.com/AI4VSLab/GP-Detection-VF-Prediction.
Automatic Lesion Size Assessment Method for Multifrequency Single Transducer Harmonic Motion Imaging Using Convolutional Neural Network
Medical imaging-derived tumor or lesion size quantification provides clinically relevant information for diagnosis and treatment monitoring. Multi-frequency single transducer harmonic motion imaging (ST-HMI) is an acoustic radiation force (ARF) based ultrasound elastography method that interrogates tissue mechanical properties by transmitting a multi-frequency ARF excitation pulse and estimating the induced oscillatory displacements at multiple frequencies simultaneously. In this study, an automated lesion size assessment method is presented using the U2Net model. The training-validation set contains 1094 multi-frequency ST-HMI images of heterogeneous phantoms with the inclusion’s Young’s moduli ranging from 6-70 kPa and diameters ranging from 1.7-10.5 mm. Dice score, sensitivity, specificity, and area estimation error were used to evaluate the performance of the model in the test phantom data set, in vivo 4T1 breast cancer mouse tumors, and ex vivo focused ultrasound (FUS)- induced thermal lesions. The average ± standard deviation of (Dice score, sensitivity, and specificity) was (0.92±0.04, 0.92±0.05, 0.99±0.01), and (0.87±0.07, 0.96±0.04, 0.95±0.04) in the phantoms and mouse tumors, respectively. The average absolute error in the model-predicted area of FUS-induced lesion was 9.45±0.77%. These results demonstrated the potential of multi-frequency ST-HMI to determine tumor size change in treatment monitoring and lesion size in ablation therapy using deep learning.
The 2023 CAIT Symposium will be adhering to Columbia Business School's COVID-19 precautions, as follows:
Vaccination: The COVID-19 vaccine mandate will no longer be in effect, and proof of vaccination does not need to be submitted. However, we strongly recommend following CDC guidelines for COVID-19 vaccination which recommend that everyone be up to date with COVID-19 vaccines, including booster doses.
Masking: Following community guidance on indoor masking is recommended, which is predicated on community transmission levels. Individuals may decide to mask based on their own circumstances and their preferences. Such decisions should be respected. Masks will be available at the check-in desk for those who would like them.
Positive COVID-19 tests: Positive test results do not need to be reported to the University; please refer to the CDC for guidance after a positive test and consult with your provider for appropriate care. The University continues to strongly recommend that participants stay home if they feel unwell and isolate according to CDC guidelines if they test positive for COVID-19. Â
We will continue to monitor all CDC guidelines and the protocols and requirements may be adjusted if conditions change. Please review this page and the Frequently Asked Questions (FAQ) before your arrival on campus so you have the most current information.
An introductory paragraph set in a slightly larger size can help provide a rhythm to the typography and help increase the legibility of the page.
In case of other emergencies or incidents on the day of the event, listen for instructions over the public address system. To report an emergency, call the Columbia Campus Emergency Number - 212-853-3333. Non-emergency incidents can be reported at 212-853-3301.
The mission of the Columbia University Center of Artificial Intelligence Technology in collaboration with Amazon is to better society through the development and adoption of advanced AI technology contributing to a more secure, connected, creative, sustainable, healthy and equitable humanity.