More

bronxbomber92 · 2025-06-20T03:21:58 1750389718

A question for the author(s) since they seem to be very responsive to this thread :).

1. How fine grain is each task? In a traditional matrix multiplication kernel, for example, each thread block is responsible for a small output tile of the resulting matrix. In Mirage's mega kernel, would there correspondingly be a task for each small output tile?

2. How does the Mirage compiler form the task graph? Does it have domain knowledge of every operator's data flow at the granularity of individual elements? Again taking matmul as an example: a given output output tile requires the correspond M_BLOCK rows of the A matrix. If the A matrix was itself an output of a prior matmul (+ nonlinearity), the dependees would be all of output tile tasks corresponding to those M_BLOCK rows of the operator that produced A?

zhihaojia · 2025-06-20T04:08:22 1750392502

1. In MPK, each task is mapped to an individual SM. The amount of work handled by a task is similar to that of a thread block in the traditional kernel-per-operator approach.

2. TL;DR: MPK automatically analyzes inter-task dependencies by tracking the input and output tensors associated with each task. A longer version: Longer version: MPK uses imap, omap, and fmap (see Section 2 of the Mirage paper) to determine each task’s input and output tensors. A dependency is introduced between task A and task B if A produces any tensor elements that B consumes—that is, if A's outputs overlap with B's inputs.

> Again taking matmul as an example: a given output output tile requires the correspond M_BLOCK rows of the A matrix. If the A matrix was itself an output of a prior matmul (+ nonlinearity), the dependees would be all of output tile tasks corresponding to those M_BLOCK rows of the operator that produced A?

Exactly. In this case, all output tile tasks that consume those M_BLOCK rows of A will depend on all tasks responsible for producing the corresponding parts of A in the previous operator.

bronxbomber92 · on Feb 6, 2025

An example of “AST via reflection”

https://docs.scala-lang.org/scala3/reference/metaprogramming...

bronxbomber92 · on Jan 21, 2025

What are the hero use cases for AI agents? Reading the website, I don't see a good demonstration of what "AI agents" are and what they're good for.

moekatib · on Jan 21, 2025

AI agents excel in tasks like automating workflows, handling complex decision trees, and managing multi-step processes across APIs. For example, an agent can monitor a sales pipeline, send follow-ups, update CRMs, or manage logistics autonomously. See our demo here: https://github.com/picahq/onetool-demo.

williamstein · on Jan 21, 2025

This particular Oxide and Friends podcast includes a fantastic discussion of AI agents, the problems with their definitions, and at least one promising use case: https://oxide-and-friends.transistor.fm/episodes/predictions...

bronxbomber92 · on June 13, 2023

I believe this post is referring to device-scoped memory barriers - also sometimes called fences - as opposed to execution barriers.

The former being a mechanism to ensure memory accesses follow a well defined order (e.g. it'd be bad if the memory accesses executed inside a critical section could be reordered before or after the lock and unlock calls).

The latter being a mechanism that ensures all threads (within some scope, perhaps all threads running on the "device") reach the same point in the program before any are allowed to proceed.

raphlinus · on June 13, 2023

That's correct, it's the memory scope that I expect to be device-scoped. GPUs tend not to have execution barriers in the shader language beyond workgroup scope; generally the next coarser granularity for synchronization is a separate dispatch. However, single-pass prefix sum algorithms, including decoupled look-back, can function just fine with device-scoped memory barriers, and do not require execution barriers with coarser scope than workgroup.

samus · on June 14, 2023

The post also mentions unspecified behavior (mixing atomic and non-atomic memory accesses) where everybody has to cross their fingers and hope that the hardware designers had the same idea about how it should work. Which is almost fine with enough test coverage, but a shader translation layer adds uncomfortable complexity on top of it.

bronxbomber92 · on Aug 29, 2022

Threads are also a tool for concurrency.

titzer · on Aug 30, 2022

And concurrency can be a tool to implement threads (albeit, with no parallel speedup).

bronxbomber92 · on May 11, 2022

The author states it wasn't actually work-life balance that was making him happy and tired. Rather, he discovered:

> By mid-2021 I was tired all the time. I know I wasn’t alone, because it was an ongoing meme inside Google2. It’s only now that I realize what was wrong: I missed the satisfaction of building things and finishing projects.

PragmaticPulp · on May 11, 2022

This is the classic justification that leads people to self-defeating workaholism: The idea that you can fill the voids in your life by just working harder.

The false dichotomy is the idea that the alternative to Google is to work more hours + evenings + weekends at a startup. He's replacing one problem with another, but this new problem feels fresh and new and like turning over a new leaf. At least for now.

ImprovedSilence · on May 11, 2022

I get what he’s saying though. There can be great joy and a positive feeling of “losing yourself” in your work when you actually get to create. I think his role and and the internal bureaucracy prevented him from using that creative energy.

bravetraveler · on May 11, 2022

Indeed, finally fixing something that may have been a thorn in your side for years is a very rewarding thing.

I've worked myself a little extra with a few of these, and I'm glad for it.

I've found a tendency in my line of work/colleagues to call patchwork acceptable, and it regularly comes back to haunt us.

We find a way to stop the 3AM calls and stop there.. neglecting that maybe one person has the necessary context and it may grow out of control again

theptip · on May 11, 2022

I don’t think it makes you a workaholic to observe that a shit work environment drains your energy and burns you out, whereas a good one can leave you feeling energized.

They weren’t saying that they needed to work harder at Google to be happy, they were saying they needed to move somewhere else where they could get job satisfaction from completing projects.

lampshades · on May 11, 2022

I mean, I'm the same way. I look back at my life and the times I didn't create things of value seem so meaningless. I don't want to go back to creating meaningless things. Even if I'm working harder, I'm enjoying what I'm doing.

The article really hit home for me, personally.

freedom2099 · on May 11, 2022

I enjoy my work… But it is not the thing that gives meaning to my life!

vasco · on May 11, 2022

It must be nice to be so sure about how others should live their lives.

nnoitra · on May 11, 2022

This is a bizarre non sequitur.

bronxbomber92 · on Jan 11, 2022

This is the same for any specialized software engineering role. Compilers, GPGPU, embedded systems, computer graphics, image processing, etc. In an interview panel for any of these roles, you will be expected to be a competent software engineer and have domain knowledge about the sub-field.

bronxbomber92 · on May 3, 2021

Do you have references to the incident(s) or published work(s)?

bronxbomber92 · on Oct 16, 2020

> The players are traded freely among teams Yes, this happens more often than in the past, but there are still many players and staff that stay with an organization for an extended period of time (e.g. Tom Brady and Bill Belichick).

> all teams are owned by the same business What do you mean? Each team has a different owner.

> So for example the San Francisco 49ers have very little to do with San Francisco, except for the name' In many geographical regions, you grow up watching the team that is closest in proximity. e.g. I grew up in rural western New York State and I grew up watching the Buffalo Bills because they were the closest franchise despite being 100 miles away.

The name is just a semantic. Would you be happier if the 49ers were called the "Northern California 49ers"?

JacobDotVI · on Oct 17, 2020

There is an American baseball team that started their life as Florida Marlins. The thinking was that they would establish this tribe mentality for all of Florida. After years of not great results in that regard they changed their name to the Miami Marlins for the same reason - to better establish a tribe. They haven’t moved locations, they are just re-targeting their brand

bronxbomber92 · on Sept 26, 2020

My impression was that the iTunes stunt was not received well because they had already lost popularity. The iTunes stunt didn't help, but I doubt it materially hurt their popularity either.

tyingq · on Sept 26, 2020

To me, it exposed U2 to a bunch of people that had never heard of them, in a very bad light. Killing off any chance of new fans. Not that they would have grown in popularity, but that the trail off got much steeper.

ilamont · on Sept 26, 2020

I agree. Other things are going on. Rock has long been in decline, the social messages and trends U2 focused on 30-40 years ago have passed, and U2 is no longer breaking new musical ground like they were from the late 70s until Pop.

CydeWeys · on Sept 26, 2020

As a lifelong rock fan, it's really been sad to live through the long slow inexorable decline of the genre. The vast majority of current popular music doesn't appeal to me at all, which simply wasn't true even as recently as the 90s.

ilamont · on Sept 26, 2020

I think the machine that leveraged interest and controlled attention for so long through radio, magazine, videos, and festivals has basically failed and young people have turned away from guitar-based rock.

My teenaged kids have no interest in what I listen to, with the possible exception of Queen, and I think that has more to do with the biopic than organic discovery.

CydeWeys · on Sept 26, 2020

I won't argue that young people are turning away from guitar-based rock, but I'm not seeing the connection between that and the changes in the popular music machine. Are you implying that people would not have liked guitar-based rock this entire time if not for said machine? That the natural state of things, which we're now reverting to, is to prefer other types of music? I think it's more just that tastes change and go through cycles, and guitar-based rock is on the decline (and might or might not recover; there are so many dead dead dead musical genres and instruments out there).