overview for CatalyzeX_code

[R] Hierarchically Gated Recurrent Neural Network for Sequence Modeling in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 11 months ago

Found 1 relevant code implementation for "Hierarchically Gated Recurrent Neural Network for Sequence Modeling".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

Adjusting Probability distribution Using Speculative Decoding [D] in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 11 months ago

No relevant code picked up just yet for "Fast Inference from Transformers via Speculative Decoding".

Request code from the authors or ask a question.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[R] Understanding the loss function of Diffusion Probablistic models vs Denoising Diffusion Probablistic Models in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 11 months ago

Found 2 relevant code implementations for "Deep Unsupervised Learning using Nonequilibrium Thermodynamics".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

Found 6 relevant code implementations for "Denoising Diffusion Probabilistic Models".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[P] fMRaI: Neural network interpretability and explainability library in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 11 months ago

Found 1 relevant code implementation for "What Does BERT Look At? An Analysis of BERT's Attention".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

Found 2 relevant code implementations for "Transformer Feed-Forward Layers Are Key-Value Memories".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[P] Distil-Whisper: a distilled variant of Whisper that is 6x faster in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 11 months ago

Found 4 relevant code implementations for "Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[R] Beyond U: Making Diffusion Models Faster & Lighter in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 11 months ago

No relevant code picked up just yet for "Beyond U: Making Diffusion Models Faster & Lighter".

Request code from the authors or ask a question.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[R] Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 1 year ago

No relevant code picked up just yet for "Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models".

Request code from the authors or ask a question.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[D] Logit Laplace Reconstruction Loss in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 1 year ago

Found 2 relevant code implementations for "Zero-Shot Text-to-Image Generation".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[R] MADLAD-400 - 4.6 / 2.6 trillion token dataset covering 419 languages + translation models up to 10.7B parameters in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 1 year ago

Found 2 relevant code implementations for "MADLAD-400: A Multilingual And Document-Level Large Audited Dataset".

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[R] MADLAD-400 - 4.6 / 2.6 trillion token dataset covering 419 languages + translation models up to 10.8B parameters in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 1 year ago

Found 2 relevant code implementations for "MADLAD-400: A Multilingual And Document-Level Large Audited Dataset".

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[R] CogVLM: Visual Expert for Pretrained Language Models in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 1 year ago

Found 1 relevant code implementation for "CogVLM: Visual Expert for Pretrained Language Models".

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.

[R] Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning in c/machinelearning@academy.garden

[–] CatalyzeX_code_bot@alien.top 1 points 1 year ago

Found 2 relevant code implementations for "Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here 😊🙏

--

To opt out from receiving code links, DM me.