Category For Dev’s

MedFuzz: Exploring the robustness of LLMs on medical challenge problems

MedFuzz: Exploring the robustness of LLMs on medical challenge problems

Large language models (LLMs) have achieved unprecedented accuracy on medical question-answering benchmarks, showcasing their potential to revolutionize healthcare by supporting clinicians and patients. However, these benchmarks often fail to capture the full complexity of real-world medical scenarios. To truly harness…

Introduction to Phi-3

Introduction to Phi-3

In this article, we will explore the Phi-3 model from the official technical report published by Microsoft. Figure 1. Phi-3 family of models (source: ). Language models, especially the Small Langauge Models (SLMs) have come a long way in the…

Combined aggregations for efficient analysis

Combined aggregations for efficient analysis

Dealing with massive amounts of data is not always straightforward. Often, a data scientist needs to trim down a large data table into a handful of key components that can be used for further analysis. For example, we may want…

Deephaven and Iceberg | Deephaven

Deephaven and Iceberg | Deephaven

Deephaven is a powerful analytics engine that makes processing large data more intuitive than ever. Iceberg is a table format that provides fast, efficient, and scalable data storage. Combining the two is like bringing Holmes and Watson together to solve…

Collaborators: Silica in space with Richard Black and Dexter Greene

Collaborators: Silica in space with Richard Black and Dexter Greene

[TEASER ENDS]  GRETCHEN HUIZINGA: You’re listening to Collaborators, a Microsoft Research Podcast showcasing the range of expertise that goes into transforming mind-blowing ideas into world-changing technologies. I’m Dr. Gretchen Huizinga. [MUSIC FADES]  Today I’m talking to Dr. Richard Black, a…