article thumbnail

Presentation: Modern Compute Stack for Scaling Large AI/ML/LLM Workloads

InfoQ

Jules Damji discusses which infrastructure should be used for distributed fine-tuning and training, how to scale ML workloads, how to accommodate large models, and how can CPUs and GPUs be utilized? By Jules Damji

Tuning 89
article thumbnail

What is IT automation?

Dynatrace

Automating IT practices without integrated AIOps presents several challenges. While automating IT processes without integrated AIOps can create challenges, the approach to artificial intelligence itself can also introduce potential issues. The challenges of automating IT and how to combat them. Developing automation takes time.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

QCon London: Lessons Learned From Building LinkedIn’s AI/ML Data Platform

InfoQ

He specifically delved into Venice DB, the NoSQL data store used for feature persistence. The presenter shared the lessons learned from evolving and operating the platform, including cluster management and library versioning. By Rafal Gancarz

article thumbnail

The death of Agile?

O'Reilly

The most important is discovering how to work with data science and artificial intelligence projects. Large teams present their own problems, but it’s ironic to see writers scorning the “two pizza group” concept because it can’t possibly work for large organizations. Can Agile work for large teams?