• 0 Posts
  • 10 Comments
Joined 11 months ago
cake
Cake day: October 27th, 2023

help-circle




  • Don’t be sad, it’s just a part of how things are you just have to choose a method and stick to it.

    I personally use Notion. I’ve created a database and added properties like date, venue, authors, organizations, etc.

    For example, the other day I needed to recap what the BLIP paper was about so I just searched the paper in the database and took a look at the page. On that page I’ve highlighted different text with different colors depending on when I came back to read it.

    Took me a while to get this working and into the habit of it though.




  • TL;DR The more constraints on the model, the more time should spend analyzing your data and formulating your problem.

    I’ll agree with the top comment. I’ve also had to deal with a problem at work where we were trying to perform product name classification for our e-commerce product. The problem was that we couldn’t afford to have anything too large or increase infrastructure costs (i.e., if possible we didn’t want to use any more GPU computing resources than we already were).

    It turns out that extensive EDA was what saved us. We were able to come up with a string-matching algorithm sophisticated enough that it achieved high precision with practically no latency concerns. Might not be as flexible as something like BERT but it got the job done.


  • TL;DR The more constraints on the model, the more time should spend analyzing your data and formulating your problem.

    I’ll agree with the top comment. I’ve also had to deal with a problem at work where we were trying to perform product name classification for our e-commerce product. The problem was that we couldn’t afford to have anything too large or increase infrastructure costs (i.e., if possible we didn’t want to use any more GPU computing resources than we already were).

    It turns out that extensive EDA was what saved us. We were able to come up with a string-matching algorithm sophisticated enough that it achieved high precision with practically no latency concerns. Might not be as flexible as something like BERT but it got the job done.