Friday, March 29, 2019

Data Science, Biomedical Informatics, and the OHSU Department of Medical Informatics & Clinical Epidemiology

(The following is reposted from Health, Data, Information and Action, the blog of the Oregon Health & Science University Department of Medical Informatics & Clinical Epidemiology.)

Data Science is a broad field that intersects many other fields within and outside of biomedicine and health, including biomedical informatics. Data science is certainly an important component of research and educational programs in the OHSU Department of Medical Informatics & Clinical Epidemiology (DMICE).

What exactly is data science? There are many methods, but one consensus is, “the multi-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from data in various forms, both structured and unstructured” [1].

The definition of data science is somewhat different from the definition of biomedical informatics, which is “the interdisciplinary field that studies and pursues the effective uses of biomedical data, information, and knowledge for scientific inquiry, problem solving and decision making, motivated by efforts to improve human health” [2].

Clearly there is overlap as well as complementarity. As noted by Payne et al., biomedical informatics deals with a broader spectrum of data and information tasks, focused not only on what is learned from data but also how that is applied in a broader sociotechnical context [2].

Many DMICE research programs focus on aspects of Data Science:
  • Re-use of data from EHR (William Hersh, Aaron Cohen, Steven Bedrick) – leveraging data in EHR to identify patients as candidates for research studies and signals for rare diseases (porphyria) [3]
  • Documenting genomic variation in leukemia (Shannon McWeeney) – allowing for repurposing of drugs [4]
  • Quality of data for clinical care and research (Nicole Weiskopf) – methods for insuring completeness and comprehensiveness of data for use in research, quality measurement, and other tasks [5]
  • Urinary microbiome in health and disease (Lisa Karstens) – identifying role of microbiome and how its genetics can be leveraged for diagnosis and treatment [6]
  • Use of ambient data to detect and manage clinician strain – Dana Womack, Paul Gorman [7]
DMICE educational programs include Data Science in many of their courses. Our Bioinformatics & Computational Biomedicine (BCB) major includes:
  • Data Harmonization and Standards for Translational Research - BMI 533/633 (Instructors: Melissa Haendel, Ph.D., Ted Laderas, Ph.D., Christina Zheng, Ph.D.)
  • Management and Processing of Large Scale Data –BMI 535/635 (Instructors: Michael Mooney, Ph.D., Christina Zheng, Ph.D.)
  • Computational Genetics -BMI 559/659 (Instructor: Shannon McWeeney, Ph.D.)
  • Bioinformatics Programming and Scripting - BMI 565/656 (Instructor: Michael Mooney, Ph.D.)
  • Network Science and Biology- BMI 567/667 (Instructor: Guanming Wu, Ph.D.)
  • Data Analytics –BMI 569/669 (Instructors: Brian Sikora, Delilah Moore, Ted Laderas, Ph.D.)
Our Health & Clinical Informatics (HCIN) major includes:
  • Introduction to Biomedical and Health Informatics - BMI 510/610 (Instructor: William Hersh, M.D.)
  • Analytics for Healthcare - BMI 524/624 (Instructors: Abhijit Pandit, M.B.A., Tracy Edinger, N.D.)
  • Clinical Research Informatics- BMI 523/623 (Instructor: Nicole Weiskopf, Ph.D., Robert Schuff)
We also have developed ample instructional materials in Data Science for other learners:
References

1. Donoho, D (2017). 50 years of Data Science. Journal of Computational and Graphical Statistics. 26: 745-766. https://dl.dropboxusercontent.com/u/23421017/50YearsDataScience.pdf.
2. Payne, PRO, Bernstam, EV, et al. (2018). Biomedical informatics meets data science: current state and future directions for interaction. JAMIA Open. 1: 136-141. https://academic.oup.com/jamiaopen/article/1/2/136/5068667.
3. Wu, S, Liu, S, et al. (2017). Intra-institutional EHR collections for patient-level information retrieval. Journal of the American Society for Information Science & Technology. 68: 2636-2648.
4. Tyner, JW, Tognon, CE, et al. (2018). Functional genomic landscape of acute myeloid leukaemia. Nature. 562: 526-531.
4. Weiskopf, NG, Bakken, S, et al. (2017). A data quality assessment guideline for electronic health record data reuse. eGEMS. 5(1): 14.
6. Karstens, L, Asquith, M, et al. (2016). Does the urinary microbiome play a role in urgency urinary incontinence and its severity? Frontiers in Cellular and Infection Microbiology. 6:78. https://doi.org/10.3389/fcimb.2016.00078.
7. Womack, D. (2018). Subtle cues: Qualitative elicitation of signs of strain in the hospital workplace. PhD Dissertation, Oregon Health & Science University.

Friday, March 8, 2019

Have We Passed the High-Water Mark of Seamless Technology?

Have we passed the high point of technology being seamless and easy to use? I recently had to give a presentation to faculty that included showing a few slides, and I assumed I could just bring my MacBook Pro, with its HDMI port, and plug it into the projector that was in the conference room on my campus where the meeting was taking place.

It was not meant to be. Despite having the right cables, my computer would not sync to the projector. That led to the search for other options. Which then led us into dongle hell. Maybe we should try the VGA adapter also available in the room. But of course my newer-model Mac had USB-C and not the older standard Mac connector that is still prevalent, at least among the dongles.

We finally got it all to work, although not without a lot of wasted time. The episode also got me thinking and wondering if we have passed with high-water mark of technology working seamlessly. Are those golden days overs?

I used to feel like an old-timer when I would reminisce about how technology used to be so awful in the late 20th century. I am sure I got snickers from the younger crowd when I bemoaned the old days of having to connect to my email over telephone modems, and trying to figure out the right number of commas in the ATDT command set of telephone modems to time when the lines would ask for things like long-distance codes to be entered. I also remember the days of bringing overhead transparencies of presentations just in case my computer would not sync with the archaic projectors. I recall the days before USB drives when transferring files between computers was sometimes impossible, especially with the disappearance of floppy disks (and presentation file sizes exceeding their capacity).

I believe that the golden days peaked shortly after the start of the 21st century. I would marvel that computers always now seemed to connect to and sync to projectors. First with wired Ethernet and then with the emergence of wifi, we were no longer at the mercy of telephone modems and noisy phone lines (or needing to find Internet cafes). We could connect to our email and institutional servers without make long-distance phone calls. We could even talk on the phone from almost anywhere via Skype.

But now we seem to be regressing. Newer projectors have more resolution, but don’t always work with older computers. Dongle hell is worst for the Mac, but the PC world is not immune. We have multiple connector types for projects, and different types of USB. An additional wrinkle is the need for encryption, and we can move files that pose very small risk (e.g., Powerpoint presentations) easily from where we create them to where we need to present them.

Technology still is marvelous when it works seamlessly. Hopefully the proliferation of cables, connectors, and security protocols will not make the golden era a distant memory