34 slides extracted.
Slide 1 — 0:08 (watch)
![]() | Thank you for attending my talk. |
Slide 2 — 0:22 (watch)
![]() | My name is Adrian Bertagnoli, and I am a founding engineer at Callosum. Today, I will discuss scaling the next paradigm of heterogeneous intelligence. |
Slide 3 — 0:32 (watch)
Slide 4 — 0:46 (watch)
![]() | To provide an intuition about heterogeneous intelligence, I will first explain the current prevailing paradigm of homogeneous intelligence. |
Slide 5 — 1:04 (watch)
![]() | Homogeneous intelligence in AI primarily refers to scaling single models across a fleet of identical chips. |
Slide 6 — 1:36 (watch)
Slide 7 — 2:12 (watch)
![]() | Given that we are currently experiencing mild heterogeneity, how can we envision a greater level of heterogeneity? What will that look like? Initially, we are in a state of mild heterogeneity. |
Slide 8 — 2:52 (watch)
Slide 9 — 3:52 (watch)
Slide 10 — 4:52 (watch)
Slide 11 — 5:40 (watch)
Slide 12 — 6:08 (watch)
Slide 13 — 6:38 (watch)
Slide 14 — 7:08 (watch)
Slide 15 — 7:30 (watch)
![]() | If you're performing a needle-in-a-haystack task, the information requirement remains constant, regardless of the size of the prompt. |
Slide 16 — 7:44 (watch)
Slide 17 — 8:00 (watch)
Slide 18 — 8:44 (watch)
Slide 19 — 9:12 (watch)
![]() | Here are the results on the Oolong benchmark, which is the benchmark referenced in the paper. |
Slide 20 — 9:50 (watch)
Slide 21 — 10:28 (watch)
![]() | The next problem we wanted to address is visual web navigation. |
Slide 22 — 10:36 (watch)
Slide 23 — 10:50 (watch)
![]() | We approached the problem by recognizing its heterogeneous nature rather than treating it as homogeneous. |
Slide 24 — 11:04 (watch)
![]() | The problem decomposes into multiple steps involving visual reasoning and textual reasoning. Each of these subcomponents requires different models to be successfully completed. |
Slide 25 — 11:16 (watch)
![]() | Here, we observe a fundamental shift in the Pareto frontier, where singular models like Kimi K2.5 and GPT-5.2 are outperformed by a heterogeneous set of models. |
Slide 26 — 11:50 (watch)
Slide 27 — 12:30 (watch)
![]() | For these sub-tasks alone, we are 11 times faster and 43 times cheaper than using ChatGPT. This contributes to our overall performance, making us 3.7 times cheaper and three times faster. |
Slide 28 — 12:50 (watch)
Slide 29 — 13:08 (watch)
![]() | The third paradigm of compute will be heterogeneous, focusing on mapping multi-agentic workloads optimally onto different chips. We are collaborating with ARIA, the UK institute. |
Slide 30 — 13:22 (watch)
![]() | We received a £3 million grant to operate the first heterogeneous co-located cluster in the UK. Our goal is to make a significant impact and lead this new era of innovation. |
Slide 31 — 13:38 (watch)
Slide 32 — 13:52 (watch)
![]() | This is the worst our infrastructure will ever be. Thank you. |
Slide 33 — 14:24 (watch)
Slide 34 — 14:54 (watch)
![]() | My name is Adrian Bertagnoli. If anyone is interested, we are hiring. Thank you very much. |

































