Disrupt 2024
Image Credits: (opens in a new window)
Sponsored

Refining pre- and post-training data strategy for LLM success

This article is presented by TC Brand Studio. This is paid content, TechCrunch editorial was not involved in the development of this article. Reach out to learn more about partnering with TC Brand Studio.

Presented by:

TechCrunch was proud to host TELUS Digital at Disrupt 2024 in San Francisco. Here’s an overview of their Roundtable session.

Large language models (LLMs) have revolutionized AI, but their success hinges on the quality and strategy of the data used.

In this Disrupt Roundtable session, Siddharth Mall, Ian McDiarmid, and Kaushik PS from TELUS Digital dove deep into the critical role of data throughout the LLM life cycle, covering topics such as:

  • Pre-training: How to curate diverse, high-quality datasets that fuel robust and unbiased LLMs.
  • Post-training: Learn about techniques like reinforcement learning and active learning to continually enhance LLM performance and address limitations.
  • Ethical considerations: Navigate the complex landscape of data privacy, bias, and fairness in LLM development.

Speakers

Siddharth Mall, VP Product, TELUS Digital

Ian McDiarmid, Strategy Director, TELUS Digital

Kaushik PS, Senior Director of Product, TELUS Digital