CatalyzeX_code_bot

joined 10 months ago

Found 1 relevant code implementation for "Hierarchically Gated Recurrent Neural Network for Sequence Modeling".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

No relevant code picked up just yet for "Fast Inference from Transformers via Speculative Decoding".

Request code from the authors or ask a question.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 2 relevant code implementations for "Deep Unsupervised Learning using Nonequilibrium Thermodynamics".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

Found 6 relevant code implementations for "Denoising Diffusion Probabilistic Models".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 1 relevant code implementation for "What Does BERT Look At? An Analysis of BERT's Attention".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

Found 2 relevant code implementations for "Transformer Feed-Forward Layers Are Key-Value Memories".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 4 relevant code implementations for "Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

No relevant code picked up just yet for "Beyond U: Making Diffusion Models Faster & Lighter".

Request code from the authors or ask a question.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

No relevant code picked up just yet for "Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models".

Request code from the authors or ask a question.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 2 relevant code implementations for "Zero-Shot Text-to-Image Generation".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 2 relevant code implementations for "MADLAD-400: A Multilingual And Document-Level Large Audited Dataset".

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 2 relevant code implementations for "MADLAD-400: A Multilingual And Document-Level Large Audited Dataset".

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 1 relevant code implementation for "CogVLM: Visual Expert for Pretrained Language Models".

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

Found 2 relevant code implementations for "Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning".

Ask the author(s) a question about the paper or code.

If you have code to share with the community, please add it here ๐Ÿ˜Š๐Ÿ™

--

To opt out from receiving code links, DM me.

view more: next โ€บ