Machine Learning

1 readers

1 users here now

Community Rules:

Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.

founded 2 years ago

MODERATORS

communick@academy.garden

[P] Comgra: A library for debugging and understanding neural networks (alien.top)

submitted 2 years ago by Smart-Emu5581@alien.top to c/machinelearning@academy.garden

10 comments fedilink hide all child comments

I'm a machine learning engineer and researcher. I got fed up with how difficult it is to understand why neural networks behave the way they do, so i wrote a library to help with it.

Comgra (computation graph analysis) is a library you can use with pytorch to extract all the tensor data you care about and visualize it graphically in a browser.

This allows for a much more detailed analysis of what is happening than the usual approach of using tensorboard. You can go investigate tensors as training proceeds, drill down into individual neurons, inspect single data sets that are of special interest to you, track gradients, compare statistics between different training runs, and more.

This tool has saved me a ton of time in my research by letting me check my hypotheses much more quickly than normal and by helping me understand how the different parts of my network really interact.

I hope this tool can save other people just as much time as it did me. I'm also open for suggestions on how to improve it further: Since I'm already gathering and visualizing a lot of network information, adding more automated analysis would not be much extra work.

you are viewing a single comment's thread
view the rest of the comments

[–] Smallpaul@alien.top 1 points 2 years ago (1 children)

Is there a subreddit for Mechanistic Interpretability? Should there be?

[–] DigThatData@alien.top 1 points 2 years ago (1 children)

this isn't mechanistic interpretability, it's debugging.

[–] Smart-Emu5581@alien.top 1 points 2 years ago (1 children)

Mechanistic Interpretability

It's primarily intended for debugging, but it can also help with mechanistic interpretability. Being able to see the internals of your network for any input and at different stages of training can help a lot with understanding what's going on.

[–] currentscurrents@alien.top 1 points 2 years ago

IMO interpretability and debugging are inherently related. The more you know about how the network works, the easier it will be to debug it.