Illustrious-Pay-7516

joined 11 months ago
 

hi, I have ~1G of personal documents that include all documents that I wrote/edited since high school. Most of them are docx/pptx/txt/markdown/pdf, and mostly text with a small fraction of pictures. I wonder if there is a rock solid backup against almost all possible corrupt in my data files? There are not large files (very few photos/videos) so I do not mind using 10x storage space (with huge redundancy to protect against any corruption) to back up data. Any ideas?

 

I read many posts talking about importance of having multiple copies. but the problem is, even if you have multiple copies, how do you make sure that EVERY FILE in each copy is good. For instance, imagine you want to view a photo taken a few years ago, when you checkout copy 1 of your backup, you find it already corrupted. Then you turn to copy 2/3, find this photo is good. OK you happily discard copy 1 of backup and keep 2/3. Next day you want to view another photo 2, and find that photo 2 in backup copy 2 is dead but good in copy 3, so you keep copy 3, discard copy 3. Now some day you find something is wrong in copy 3, and you no longer have any copies with everything intact.

Someone may say, when we find that some files for copy 1 are dead, we make a new copy 4 from copy 2 (or 3), but problem is, there are already dead files in this copy 2, so this new copy would not solve the issue above.

Just wonder how do you guys deal with this issue? Any idea would be appreciated.

[–] Illustrious-Pay-7516@alien.top 1 points 11 months ago (1 children)

do you read it e.g. every few days, to refresh your memory? Or just use it as reference? If it is latter, how does it compare with direct Google search?

[–] Illustrious-Pay-7516@alien.top 1 points 11 months ago (1 children)

yeah that works for me, but there are too many papers out there

 

A few months ago I came across maximum mean discrepancy as a measure of distribution difference, and today I read this term and totally forgot what is means and had to find a youtube video to refresh my understanding. This happens a lot of times in my research. I feel like unless something is really basic (e.g. CNN, cross entropy, etc) and used a lot in my day-to-day model building, I easily forgot what I have read. I wonder is it just because I have a bad memory or I do not have a good way to organize information?

[–] Illustrious-Pay-7516@alien.top 1 points 11 months ago

Can someone help me understand, what does it mean by "smarter than humans"? Do those LLM just read internet text written by humans?