Skip to content

Making language models accessible to all

Video by Sam Han | Text by Ignacio Lobos | UW-IT Communications & Engagement
Tim Dettmers, center. Artidoro Pagnoni, right.
Tim Dettmers, center. Artidoro Pagnoni, right.

Something that easily gets lost when people talk about working with large language models such as ChatGPT is cost and the unbelievable amount of power needed to run them.

That’s where Tim Dettmers, a Ph.D. student at the University of Washington, comes in. His work is focused on machine learning and large language models, with the ultimate goal of making them more widely accessible to a larger field of researchers.

There are plenty of bright people all over the U.S. and the world who have a lot to contribute to the emerging language model field, but often they lack access to computational resources to do their own research.

But Dettmers has found a way to allow researchers to work with these large models on a simple laptop — and perhaps not too distant into the future, allowing you and I to train our own language models on our smart phones.

Learn what he is doing on this video about his work, and the future of language model research.

“What our research is showing is that you don’t need the expensive servers that can be like $50,000 or more expensive” to work with the largest of the language models now available, Dettmers said. “You can use a consumer (computer) so people can set up at home and use these things at home.”

But their work also extends beyond merely using open source language models. “You can take it and make it your own. You can personalize it, you can fine tune it on your data. This is very powerful,” he said.

“I want to make our work as widely accessible as possible for the people with the least resources … and accelerate research so we can figure out more things about language models and how to use them well,” he said.

Partnering with UW-IT to make it happen

Dettmers and his colleagues have been doing their research on Hyak, the University’s own supercomputer, which is managed by UW-IT. Hyak, Dettmers said, has accelerated their work and made it possible to use large language models — and to personalize them, which is an even more difficult undertaking.

“Research computing at the University of Washington is just a game changer,” Dettmers said. “We need so much computational resources in our research. Hyak (makes it possible) to do the work that we need to do in order to stay at the cutting edge.”

 

UW Tacoma’s BioDepot makes analyzing biomedical data a snap

 

Video by Sam Han | Text by Ignacio Lobos | UW-IT Communications & Engagement

When Ka Yee Yeung, professor at the School of Engineering and Technology at UW Tacoma, was looking for computing power to ignite her lab work, she turned to UW-IT and the e-Science Institute for help.

“Modern day biology is very much a data-rich science, and a lot of modern day technologies like sequencing technology, like microscopes and all of that, generate really large datasets,” she said.

So finding a way to work with so much data is essential to advancing the work of biomedical researchers. Her team partnered with UW-IT and e-Science Institute computing experts to find suitable cloud computing solutions and even cloud credits that make it less expensive for her graduate students to conduct their cloud computing work.

The collaboration has allowed Yeung and her lab to develop computationally optimized methods and software tools to make it easier for biologists and clinicians to analyze biomedical data.

In the past year, they have been working to develop a platform called the BioDepot Workflow Builder. The open source platform, funded by the National Institutes of Health, allows researchers to build bioinformatics workflows by combining interchangeable and encapsulated widgets. That process allows researchers to more easily implement and test new algorithms and observe how output differ.

“Our goal is to make it easier for our biologists and clinicians to interactively and reproducibly run their analytical workflows,” she said.

Reproducible analysis of data is a key part of the research process, particularly when big datasets are involved.

“A lot of the dataset we work with are what we call sequencing data, which looks at genetic variations across different people or patient samples. Reproducible analysis of data is very essential,” Yeung said.

With BioDepot, her team is working to address these issues, making research data reproducible — and making it easier for researchers to interact with the data without training in programing.

For more information and to hear Yeung talk about her work, watch this UW-IT video.