this post was submitted on 23 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

I've been diving a lot deeper into some interesting neural network papers recently and I'm looking to try and implement some of the models detailed in the papers. In general, I know that many papers include the code or I can just google the code to implement the model but I want to push myself to start implementing from scratch more.

Could anyone offer some tips on how they got started or gained the skills to be able to implement a model effectively within a few hours? Any advice would be much appreciated!

top 8 comments
sorted by: hot top controversial new old
[–] DollyNorman@alien.top 1 points 11 months ago (1 children)

I've found https://paperswithcode.com/ and github search & topics useful tools. In my experience implementing a paper from scratch is a fantastic way to gain a deeper understand of a paper. Don't be discouraged, I can't remember a single paper I've tried to implement that provided all the details necessary to implement.

[–] Avistian@alien.top 1 points 11 months ago (1 children)

Out of curiosity - What do you do, when you are missing some key information?

[–] DollyNorman@alien.top 1 points 11 months ago (2 children)

I think this is why implementing is such a useful learning tool. Papers I tried to implement I probably read cover-to-cover 10 or 20 times, as opposed just skimming abstract, method, results.

When missing key info, after searching the paper a few times:

  • my first step was see if I could find anything on paperswithcode or github.
  • failing that, google searches. see if I could find anything on forums, stack exchange sites, reddit.
  • uni library or academic paper web search engines
  • see if any papers that cite the paper I'm implementing give some clue
  • last resort was to look into papers they cite.

That was all the tricks I had, keen to know any more.

[–] DryArmPits@alien.top 1 points 11 months ago

You can also contact the authors to ask them. Most of us are not monsters and will happily talk about our work. :)

[–] Ifkaluva@alien.top 1 points 11 months ago (1 children)

What do you do if you find a reference implementation? Just run it? Try to implement from scratch?

[–] DollyNorman@alien.top 1 points 11 months ago

I tried to copy as much as possible from the paper, then fill in the blanks with how I would have solved it. I only had a partial solution but I still learnt heaps more than just reading the paper.

[–] Nique_Eau@alien.top 1 points 11 months ago

i found it helpful to read related papers and code, break down the model into smaller components, and slowly build it up. sometimes you just gotta experiment and fill in the gaps as you go.

[–] brainx98@alien.top 1 points 11 months ago

I am also in that posture. i will be reading the comments. what i am currently doing is buildong from scratch most of the common Models.