AnnouncementsMatrixEventsFunnyVideosMusicAncapsTechnologyEconomicsPrivacyGIFSCringeAnarchyFilmPicsThemesIdeas4MatrixAskMatrixHelpTop Subs
5

Rumble has made efforts to prevent servers from "scraping" normal information off of a page. It's pretty normal to grab meta data off of a page through a number of competing meta tag schemes. But to do that you have to get the html content to parse.

I suppose maybe as a counter measure to negative scraping they've made it so default curl or node-fetch will just get served looping redirects. The browser doesn't get served these redirects.

Of course there is always a way to more effectively simulate a browser and get the content you want. I "The Scrapist" certainly know how to do that. But I decided if they want to be a bitch I'll use my scrapist skills and code I've already written to rewrite their url to one that live scrapes and redirects the mp4.

If Rumble wants sites to link out to them they can make it easy for those sites to get metadata. Otherwise I'll just rip their mp4, or better yet, still make them serve it.

Comment preview

If you can make it work that would be great, but I can just put the links in text posts it’s all good.

[-]x0x72(+2|0)

True. Just from my end it makes sense that if someone does submit a Rumble link as a link post it should do something.

I really should make Rumble links submit as Rumble links if I was doing things properly. I could get the meta data if I really wanted / kind of already am because the code that does the mp4 redirect is able to get metadata.

But I prefer the fuck you solution ATM if I'm being honest. And I suppose I also wanted to make this post to say BTW in some ways this hasn't been a GoatMatrix bug that the Rumble links don't work and is kind of a Rumble bug as they are going out of their way to make a very normal way to code something not work.

It’s all Good man