Introducing Bitmagnet: A self-hosted BitTorrent indexer, DHT crawler, content classifier and torrent search engine with web UI, GraphQL API and Servarr stack integration

mgdigital@lemmy.world · 1 year ago

Introducing Bitmagnet: A self-hosted BitTorrent indexer, DHT crawler, content classifier and torrent search engine with web UI, GraphQL API and Servarr stack integration

spiritedpause@sh.itjust.works · 1 year ago

Dude this is amazing! Exactly the sort of thing I’ve been hoping would pop up to further “decentralize” the torrent search experience.

So I’m trying to run it on my machine through the docker-compose option, and I’m seeing something weird. It shows as successfully running, but when I go to the port it should be running on, I get “unable to connect” on my browser.

When I check my containers running, it shows the 3 bitmagnet containers, but the port doesn’t show.

https://i.imgur.com/D4R1Le5.png

droopy4096@lemmy.ca · 1 year ago

@mgdigital, first thing I’be noticed: reliance on “heavier” database stack (pg + redis), at least from the first glance at docker-compose. My suggestion would be to have an option for minimalist setup with sqlite and without redis if possible. That would work better for those of us flying with minimal hardware (rpi, old PC and such).

Stephen304@lemmy.ml · 1 year ago

A dht crawler is inherently an intensive service to run, magnetico used sqlite and would take 10 minutes just to load the splash page that includes the total count of discovered torrents.

mgdigital@lemmy.world · 1 year ago

Hi, this is a great point and one that I’ve already given consideration to. I’ll address separately the issue of the primary datastore ,i.e. Postgres, and the Redis dependency:

Postgres as the only option for the data store

There are 2 reasons for this:

Performance: while SQLite could offer a simpler/embedded data store, it simply doesn’t have the performance and features of Postgres. Bitmagnet has a faceted search engine and is write-intensive (it will be discovering ~5k torrents per hour and writing these to the database along with associated metadata). As such, its database may not be suitable for running on older hardware. A SQLite adapter, if it was developed, may simply not be up to the job (although as I haven’t attempted this I can’t say what the performance would be like). That said, Bitmagnet itself is not especially resource intensive, you could probably run it on a Raspberry PI but point it to a Postgres instance on some more powerful hardware. At this stage I’ve only been running it on a M2 Mac Mini with Postgres located on its SSD and so would be interested to know people’s mileage on other hardware.
Development, support and maintenance overhead: I’m a lone developer and this project is already too big for one person. A SQLite adapter, if feasible performance-wise, I think could only happen if other contributors joined the project as my to-do list is already pretty long. It would have to achieve feature parity with the Postgres implementation which makes use of several Postgres-specific features and extensions. It would also mean a longer testing cycle and therefore probably a slower release cadence. That said, if there was enough demand and assistance then I’d be open to looking into the feasibility of this once the rest of the application is a little more mature and the current database schema more finalised.

Redis dependency

Redis is currently used only for the asynchronous task queue. I would like to have put this in Postgres, but there simply is not a good out-of-the-box solution that works well with Postgres and GoLang, and is actively maintained. I looked at quite a few queuing libraries and eventually settled on asynq (https://github.com/hibiken/asynq), which is a great library and does the job well - but could really do with support for non-Redis backends.

Using Redis here was a pragmatic decision that allowed me to make progress, rather than an optimal one. I guess I could have built a simple Postgres-based queue myself but that would have been a distraction and probably sub-optimal compared with a mature/separately developed library. It remains an option. Since I looked into this a new project has sprung up which I’m keeping an eye on - https://www.tork.run/ - it has a Postgres backend and looks like it might be up to the job, but is very new.

So yes, I’m very aware that the additional Redis dependency is not ideal and it may well disappear at some point.

Decronym@lemmy.decronym.xyz · edit-2 1 year ago

Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I’ve seen in this thread:

Fewer Letters	More Letters
NAS	Network-Attached Storage
Plex	Brand of media server package
SSD	Solid State Drive mass storage
VPN	Virtual Private Network

4 acronyms in this thread; the most compressed thread commented on today has 9 acronyms.

[Thread #191 for this sub, first seen 5th Oct 2023, 14:25] [FAQ] [Full list] [Contact] [Source code]

prim3r@lemmy.ca · 1 year ago

This looks really cool! How resource intensive is this? What sort of storage requirements are there for this to be a reasonably reliable method of acquiring media? I’m probably just gonna find out myself. I’ve recently fully switched over to usenet, but this could make torrents pretty compelling again.

kautau@lemmy.world · edit-2 1 year ago

As someone interested in Usenet, what’s the best provider and client to start with in your opinion?

prim3r@lemmy.ca · 1 year ago

I’ve been using easynews/nzbgeek/nzbget with an arr stack on debian and it’s worked well for me. I’m fairly new to usenet, so take this with a giant grain of salt.

deafboy@lemmy.world · 1 year ago

Running for 6 days, save_pieces: false

My database is currently 184 GB

pedroapero@lemmy.ml · 1 year ago

Great project !

Naming conventions are missing some important information like bitrate, color depth, and most importantly language and subtitles.

Do you plan to scrape additional infos from known torrent sites (searching for torrent hashes for well named torrents) ?

mgdigital@lemmy.world · 1 year ago

Scraping torrent sites will be avoided is it’ll be prohibitively slow and break the self-sufficiency concept - we’ll infer as much as possible from the torrent meta info alone. You could have a guess at the bitrate from the file sizes. Sonarr/Radarr will already do this for you with quality profiles I think.

Shepy@feddit.uk · 1 year ago

This sounds amazing, definitely going to add this to my servarr setup next few days.

Molecular0079@lemmy.world · 1 year ago

Is it safe to run this without a VPN if I am just using it to index?

ryannathans@aussie.zone · 1 year ago

Does it infiniely crawl, storing all metadata about every torrent it finds forever?

BlueÆther@no.lastname.nz · 1 year ago

seems to work well

just one question, is it expected to have 10,000 out of 12,000 as unknown?

palitu@aussie.zone · 1 year ago

Very cool!

Willdrick@lemmy.world · 1 year ago

This looks kinda neat, I even tore down my whole servarr stack to give it a go, alas I can’t get bitmagnet to “talk” with prowlarr. I’m probably doing something really stupid, but I can’t figure out how to add the whole thing under a single docker network, I get errors like network somename was found but has incorrect label com.docker.compose.network set to ""

726a67@lemmy.sdf.org · edit-2 1 year ago

Looks super interesting; starred!

Will report back once I’ve run through the installation.

LienNoir@lemmy.world · 1 year ago

Hi, am i missing something, the bitmagnet image keep restarting when i check with “docker ps”, the other 2 containers are working as intended. And port 3333 doesn’t show anything.

mgdigital@lemmy.world · 1 year ago

There’s a PR currently open for multi-platform builds so should have this sorted soon

emhl@feddit.de · 1 year ago

What are your logs showing? docker logs -f bitmagnet

LienNoir@lemmy.world · 1 year ago

log: exec /bitmagnet: exec format error

I am on ARM (pi4) maybe it’s the issue

drugo@sh.itjust.works · edit-2 10 months ago

deleted by creator

emhl@feddit.de · edit-2 1 year ago

the parent image should support that arm version, so you could just build the docker image locally on your pi and use that.

Btw. There already is an open pull request to add arm support

LienNoir@lemmy.world · 1 year ago

thanks, for pointing that out, it works great now.

Shdwdrgn@mander.xyz · 1 year ago

Looks like a fun project, but will you be providing any info on setting it up from scratch? I just don’t have an interest in docker containers.

mgdigital@lemmy.world · 1 year ago

Hi, yes this is mentioned on the installation page of the website, below the Docker instructions. The app can be installed Dockerless using go install; if you choose this option you’ll have to provide and configure Postgres and Redis instances for the app to connect to. That said, Docker is the recommended and easiest option.

Shdwdrgn@mander.xyz · 1 year ago

I saw that, but didn’t recognize the ‘go’ command as anything available on Debian. Just did some quick digging though and now I see it’s a new language and I believe I have an idea how to get it installed for compiling so I will give that a shot.

paris@lemmy.blahaj.zone · 1 year ago

Golang v1.0 was released in March of 2012. Not sure I would consider it a new language.

mctoasterson@reddthat.com · 1 year ago

Maybe I’m misunderstanding but wouldn’t it just be easier to use a good private tracker, assuming you can get an invite?

lud@lemm.ee · 1 year ago

Yes, of course.

Introducing Bitmagnet: A self-hosted BitTorrent indexer, DHT crawler, content classifier and torrent search engine with web UI, GraphQL API and Servarr stack integration

Introducing Bitmagnet: A self-hosted BitTorrent indexer, DHT crawler, content classifier and torrent search engine with web UI, GraphQL API and Servarr stack integration

Home

What is a DHT crawler?

Currently implemented features of Bitmagnet:

Interested?

Postgres as the only option for the data store

Redis dependency