[ home / board list / faq / random / create / bans / search / manage / irc ] [ ]

/hydrus/ - Hydrus Network

Bug reports, feature requests, and other discussion for the hydrus network.

Catalog

See 8chan's new software in development (discuss) (help out)
Infinity Next Beta period has started, click here for info or go directly to beta.8ch.net
Name
Email
Subject
Comment *
File
* = required field[▶ Show post options & limits]
Confused? See the FAQ.
Embed
(replaces files and can be used instead)
Options
Password (For file and post deletion.)

Allowed file types:jpg, jpeg, gif, png, webm, mp4, swf, pdf
Max filesize is 8 MB.
Max image dimensions are 10000 x 10000.
You may upload 5 per post.


New user? Start here ---> http://hydrusnetwork.github.io/hydrus/

File: 1450152218516.png (344.62 KB, 509x600, 509:600, 554jz2K.png)

 No.1575

I have a feature request that I think might be fairly large. I could be wrong

Can we get a feature that allows someone to input an md5 hash into the program, and it scans your collection and tells you if its already in hydrus, perhaps make it runnable standalone along with hydrus, as to make it a little more lightweight.

I run into a LOT of images, that I have no idea if i have or not, and I want to know if i already have the image. If I could grab the md5 and have it scan my hydrus library, i could cut down on my duplicate file saving a lot.

I am considering this as its own program, with its own database that gets generated, but im a novice, so it would take a long ass time to learn how to code something like that, and it might be more useful if it were something already integrated with hydrus

thoughts?

 No.1580

File: 1450203461445.jpg (324.4 KB, 800x500, 8:5, e05df72d00b25b94109ab80863….jpg)

I already cache md5 and some other hashes for local files, so this wouldn't actually be too hard.

The system:hash predicate is something I mostly use for debug and only works for sha256, but I can give it a dropdown to select which hash type you want. You could put the md5 in there, and if you get a file back, you have it! That's a bit of a clunky workflow if you want to test a lot of files, but I think it would be ok until we have something better in place.

I am eventually planning to have the client serve a lot of queries, including "here's a hash, what's the file status?" through its local http server so people with programming ability can write scripts to automate imports and tagging and so on. If you wanted, you could plug into this advanced system to do what you want quicker.

Anyway, for now, I'll make a note to extend the predicate, probably next week.

Just as an aside, in case you didn't know: Hydrus uses its sha256 to compare file contents just like you want to do, so that if you already have a file and try to import it again from another location, it will recognise that it already has the file. Feel free to import the same file several times to hydrus–it won't cause dupes (it usually says 'already in db' when this occurs). Furthermore, if you run a 4chan/8chan thread watcher, the md5 those sites provide in their API is automatically used to skip files you already have.




[Return][Go to top][Catalog][Post a Reply]
Delete Post [ ]
[]
[ home / board list / faq / random / create / bans / search / manage / irc ] [ ]