umami_wasbi

joined 1 year ago
[–] umami_wasbi@lemmy.ml 5 points 22 hours ago* (last edited 22 hours ago) (1 children)

I don't a single guide for you but I can layout a road map.

  1. A programming language. I prefer Python.
  2. Basic HTML syntax and CSS selectors
  3. HTTP, specifically methods, status code (no need to memorize all cuz you can go look it up), and cookies

After you got those foundation ready, you can go on and try to build a webscraper. I advice aginst using Scrapy. Not because it is bad but too overwhelming and abstracted for any beginner. I will instead advice you use requests for HTTP, and BeautifulSoup4 for HTML parsing. You will build a more solid foundation and transition to scrapy later when you need those advanced function.

When you get stuck, don't afraid to pause on your attempt and read tutorials again. Head to the Python Community on Discord to get interactive help. We welcome noobs as we once were noobs too. Just don't ever mention scraping there as they can't help if they suspect you're trying to do something inappropriate, malicious, or illegal. They are notoriously aginst yt-dlp which frustrates me a bit. Phrase it nicely and in an generic way. I will be there occasionally offering help.

[–] umami_wasbi@lemmy.ml 9 points 1 day ago (3 children)

There is no simplification that you're looking for. It seems you don't have a programing background. If you really need to scrape something, you need to learn a programing language, HTTP, HTML, and maybe javascript. AFAIK, there is no easy way or point and click scrapper building tool. You will need to invest time and learn. Don't worry, you should be able to get it done in 2-3 months if you do invest your time in.

[–] umami_wasbi@lemmy.ml 2 points 1 day ago

It is a ok tool to get things started.

[–] umami_wasbi@lemmy.ml 2 points 5 days ago

Starting out with Python by Tony Graddis

I read the 3rd edition in library, now it's 6th. Don't know if it is as good as the memory serves.

[–] umami_wasbi@lemmy.ml 2 points 1 week ago (4 children)

How about torrenting?

[–] umami_wasbi@lemmy.ml 1 points 1 week ago

Ops. Missed that part.

[–] umami_wasbi@lemmy.ml 2 points 1 week ago* (last edited 1 week ago)

I use BTRFS for snapshots, and auto compression. Maybe it can be done with raids with LVM? AFAIK BTRFS redundancy is basically the same as traditional RAID, similar to using mdadm. Still, you would want a backup strat instead relying on the disk redundancy. I learn that the hardway.

[–] umami_wasbi@lemmy.ml 3 points 1 week ago* (last edited 1 week ago) (11 children)

I would just skip RAID, add all disk to a single BTRFS and use the built in profiles for (meta)data redundancy.

Cache I don't know much tho.

https://btrfs.readthedocs.io/en/latest/btrfs-device.html

[–] umami_wasbi@lemmy.ml 2 points 1 week ago (6 children)

Is this finally the dusk of SO? It helps alot, but also suck alot.

[–] umami_wasbi@lemmy.ml 39 points 1 week ago* (last edited 1 week ago) (2 children)

Yay, more subscriptions.

👌Adobe, I am sticking to my Affinity Photo 1.

[–] umami_wasbi@lemmy.ml 7 points 1 week ago (1 children)

It is their job to find evidences, not my resposibility to provide them.

[–] umami_wasbi@lemmy.ml 3 points 1 week ago

I'm on S21FE and it does NOT.

 

(Rant)

At somepoint, HSBC decided KDE Connect installed via F-Droid is less secure.

Photo of the HSBC UK app urging I install KDE Connect via GPlay or Galaxy Store

Then it decide non-whitelisted keyborads are a security risk. Only Gboard and Samsung Keyboard is confirmed within the whitelist.

Photo of the HSBC UK app telling me to switch input method citing security risk


I understand the point that risk can be introduce at various points, yet this is simply too much. Yeah there are people phone infected by malware but from Play Store. Not a single time I heard one ever happened on F-Droid distributed apps, at least not from the official repo. Also, I will put more trust on an open source keyboard than any proprietary keyboard.

Furthermore, I'm shocked that an app can read my app list, and current keyboard (introduced in Android 14). This just make building a profile much easier as I belive everyone almost have an unique set of apps they like. I don't think any apps need such functionality. Why the f it needs to care what input devices I uses? This make me worry more about untold (aka burried deep in Privacy Policy) data collection.

25
submitted 2 weeks ago* (last edited 2 weeks ago) by umami_wasbi@lemmy.ml to c/lemmy@lemmy.ml
 

There is "block instance" under "settings > blocks" but what does it do? I added a few onto the list but it seems does nothing to remove contents links to the instance.

What I want to achieve is to block all users post from specific instances when spams are high.

 

How come this wasn't getting more attention?

 

There are reports in Registar's comment section that Malaysia didn't only redirect DNS traffic, but took active measures to block VPN, and MITM DoH where Cloudflare's DoH returns local ISP certificate.

In fact, some ISPs like Maxis and Yes were already blocking VPN (I see a lot of complains on Lowyat.net about Maxis blocking VPN, and I was using Yes WiMax and experienced the blocking firsthand. I couldn't connect to PPTP endpoints and L2TP endpoints caused the modem to disconnect from the network and reboot).

They were outright trying a MITM redirect attack on those using DOH. Many reported error messages saying that Cloudflare's DOH server were practically returning the certificate for Telekom Malaysia's DNS servers.

Even with many new technologies, I ralized that I not as safe and free as I want to be, maybe you too.

 

If $70 +$10/mo can get me through all those annoying CAPCHAs, I will gladly pay. Of course, if cheaper or even free solutions exists, I will use it. My only requirement is it work 90%+ of the time.

 

tl;dr: only applies to NY Eastern District, and likely only US citizen can enjoy

28
submitted 3 months ago* (last edited 3 months ago) by umami_wasbi@lemmy.ml to c/linux@lemmy.ml
 

I want to check if my Lenovo T480 is afftected by the recent PKFail, but have no idea how to extract the bios firmware for validation. Can someone detail the steps? Thanks.

40
submitted 4 months ago* (last edited 4 months ago) by umami_wasbi@lemmy.ml to c/selfhosted@lemmy.world
 

Just wonder what if my mail server went offline for some periods, and the sending party couldn't deliver.

Will there be any consequences except I don't get the mail? I tried searching but they all in the perspective of a sender and get a bounce, rather the other way around.

20
submitted 4 months ago* (last edited 4 months ago) by umami_wasbi@lemmy.ml to c/selfhosted@lemmy.world
 

Saw they have promotion £1/mo without setup when paid for a 12mo contract for the lowest end VPS. Anyone use it before?

Just planning to run frp on it. https://github.com/fatedier/frp

 

LOL

 

archive.is

Shall we trust LM defining legal definitions, deepfake in this case? It seems the state rep. is unable to proof read the model output as he is "really struggling with the technical aspects of how to define what a deepfake was."

view more: next ›