Imdex is an OCR-Indexed image searching engine, making text within images searchable.
Imdex allows you to search for images based on the the text found inside them. It runs a crawler that indexes images found online by running them through both image transformations, and Tesseract, an open source OCR engine.
Click any of the three links below to see info/stats. Start typing anywhere on the page and press enter to search for a query. 4 or more letter words work best due to inaccuracies in Tesseract.
Tesseract, mongodb, jsfeat, mongoose, node canvas, async, express, jquery, elasticsearch,mustache
Voting is now closed.
A Medicore Corporation
Uva Wellassa University of Sri lanka
This is completely untrue. I can show you any file data or proof that you need, that I didn't start working on this until NKO began.
I misheard the intro. Re-scored.
Well, unlike Google, I unfortunately don't have a warehouse of dedicated crawling servers at hand.
If imdex were to actually become something widely used, I'm sure I could collect funds to expand my computing power. 8million+ images is not bad for just one crawling server though!
8 million!? I somehow got the idea that it was only 100k or so. Well done. Vote updated :)
Also, hello fellow Cambridger
Ah, yes. If you click stats on the home page, you can view the current index count (with a lag time of ~10 seconds)
And hey! Glad to see another cambridge competitor here. Best of luck to you in NKO.
Do you have any constructive criticism for me? Did something not work?
Ah, ok that makes sense; I've actually noticed that 3 letter words aren't working too well due to having fuzzy-search enable for searching. Larger words seem to be significantly more accurate though.
Thank you for the feedback!
Big-Oh Studios, Inc.
Sorry your searches didn't turn up correctly :(
Tesseract often has 1 letter typos (eg. twatter instead of twitter), so I am using a fuzzy search algorithm. Because of this, imdex seems to work a bit better on searches with longer words.