Author Topic: I kinda miss those 5000 search engines.  (Read 3690 times)

littleman

  • Administrator
  • Hero Member
  • *****
  • Posts: 6546
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #15 on: May 26, 2018, 12:29:15 AM »
Can you talk about the operating costs and/or business model for mojeek?

BoL

  • Inner Core
  • Hero Member
  • *
  • Posts: 1206
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #16 on: May 26, 2018, 07:00:48 AM »
I'm probably not in the best position to talk about it, but the owner might be interested in stopping by sometime.

Privacy and no 3rd party tracking is a cornerstone of all future plans though. He's pointed out some fundamental issues with the way DDG serve their results...

I'm sure there will be ads at some point, but nothing that would involve your browsing history.

aaron

  • Inner Core
  • Full Member
  • *
  • Posts: 229
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #17 on: May 26, 2018, 08:50:54 AM »
(Having studied their results, I think a bigger index and fresher results solves a lot of the relevancy issues, their ranking algo is pretty sophisticated)
Looked quite decent at a glance. Couple quick bits of feedback
  • One aspect that is tough is some B2B bleed into B2C queries. As an example, Authorize.net embedded in many sites, so is ranking #1 for credit cards. Some of the big search engines with lots of end user clickstream data likely get around this problem by feeding the normalized click data back into rankings & using it to augment or validate the link graph.
  • Second thing I saw is lots of listings with no meta description or snippet.
  • Third would be not sure I love the brand name. Where did the name come from? Is it synonymous with something else?

Quote
(Having studied their results, I think a bigger index and fresher results solves a lot of the relevancy issues, their ranking algo is pretty sophisticated)
If they get enough scale to strike partnerships with some leading sites in popular categories that could help cover some such relevancy issues. Some of the bigger search engines might get worse on this front as more news publications erect paywalls.

Quote
The problem with the native Reddit engine is that it only uses its onsite data to display results, but combining that data with actual on page content would make for a great search engine.
If the signal were kept secret it could probably work well, but if they announced it & had significant search marketshare that signal would get hit hard quickly.

Mackin USA

  • Inner Core
  • Hero Member
  • *
  • Posts: 2905
  • Abstract Artist
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #18 on: May 26, 2018, 12:58:17 PM »
I read the title to this thread and thought "I MISS HIDDEN TEXT"

#Smile
Mr. Mackin

Brad

  • Inner Core
  • Hero Member
  • *
  • Posts: 4149
  • What, me worry?
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #19 on: May 26, 2018, 03:46:01 PM »
>mojeek.com

For a relatively small DB the results I got were pretty good.  Some of the warhorses were missing but everyone displays them because they are copying the Google serps as much as they can. mojeek is ranking for itself which brings up interesting stuff that are just as relevant as the warhorses.

This all makes me wonder if good metasearch engines might make a comeback?  Part of what killed metasearch was all the engines and directories died so it was just Google and Bing and engines using Bing, not very interesting.  But with Gigablast (open source) and mojeek, Yandex and others combined with Bing maybe it's time to revisit the viability of metasearch.  (DDG is really just a blended metasearch anyway.)

In this thread: http://th3core.com/talk/water-coolerextra/new-sighting-'duck-duck'-as-a-verb/  we talk about searx.me an open source meta search engine that is pretty good and hackable.  Something like that might be interesting.  I wish I had the knowledge to install and maintain the script because I'd put it on a server, I guess for no other reason than to say I was back into search again.   8)  Or maybe bribe littleman to put a searx instance up on th3core so we don't get tracked by The Man.

littleman

  • Administrator
  • Hero Member
  • *****
  • Posts: 6546
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #20 on: May 26, 2018, 06:22:29 PM »
 "I MISS HIDDEN TEXT"

Have some nostalgia on me.

Brad

  • Inner Core
  • Hero Member
  • *
  • Posts: 4149
  • What, me worry?
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #21 on: May 26, 2018, 09:17:36 PM »
While we are talking, does anyone know anything about Wotbox.com?  It was a spidering engine I wonder if they are still active? 

Also Exalead?

littleman

  • Administrator
  • Hero Member
  • *****
  • Posts: 6546
    • View Profile

Brad

  • Inner Core
  • Hero Member
  • *
  • Posts: 4149
  • What, me worry?
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #23 on: May 27, 2018, 11:09:35 AM »
It looks like they spidered for awhile in 2012 and then quit.  Backfill quality sort of results.  So I'll put it in the dead column.

Brad

  • Inner Core
  • Hero Member
  • *
  • Posts: 4149
  • What, me worry?
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #24 on: February 16, 2022, 08:01:59 PM »
>directories ... They had their day and it is largely over.

I think there may be room for  a hybrid directory search engine based on popularity and categories from a site like Reddit.  The idea would be a spidering search engine that uses the sub-reddit data, popularity waves and 'upvote/downvote' scores to affect the ranking algorithm.  There is a lot going on there that could be applied to rank, stuff that DMOZ and sites like DirectHit use to try to do.

The problem with the native Reddit engine is that it only uses its onsite data to display results, but combining that data with actual on page content would make for a great search engine.

Sorry to reactivate a really old thread but I remembered something Littleman said (quote above) about Reddit being useful for a hybrid directory search engine.

I read this today which reminded me of Littleman's quote:

https://boingboing.net/2022/02/16/google-is-dead-long-live-google-sitereddit-com.html

Link to original essay which is interesting: 

https://dkb.io/post/google-search-is-dying?utm_source=tldrnewsletter

The point being that for many things like reviews Reddit content is better than most of the packaged ad supported review sites Google thinks are good.  I know that's not exactly what Littleman was talking about but it is interesting how Reddit keeps getting mentioned as a good interesting source (RC!). 

rcjordan

  • I'm consulting the authorities on the subject
  • Global Moderator
  • Hero Member
  • *****
  • Posts: 16315
  • Debbie says...
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #25 on: February 16, 2022, 08:56:34 PM »
For on-going research on several topics from medical to news, reddit is absolutely my best resource. 

Brad

  • Inner Core
  • Hero Member
  • *
  • Posts: 4149
  • What, me worry?
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #26 on: February 17, 2022, 01:57:11 PM »
I've not really explored Reddit to any depth, but there are a couple of subreddits I found and lurk in.  I'm beginning to see the value.   Reddit to me is a giant rabbit hole I used to avoid but I'm going to explore some more.

I never saw the need for up and down voting threads/replies but I now see the value.  At least on certain subreddits voting moves the trolls and shills down and out of the way.  It's not perfect but it helps.

rcjordan

  • I'm consulting the authorities on the subject
  • Global Moderator
  • Hero Member
  • *****
  • Posts: 16315
  • Debbie says...
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #27 on: February 17, 2022, 02:18:27 PM »
For your regularly visited /r's I suggest bookmarking the /new for descending cron order.

But it can be a firehose.  Let me know if you want TM filters.

rcjordan

  • I'm consulting the authorities on the subject
  • Global Moderator
  • Hero Member
  • *****
  • Posts: 16315
  • Debbie says...
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #28 on: February 17, 2022, 02:22:40 PM »
Also, don't forget that you can revert to the "old" text version of subreddits. Handier for scanning.

For instance, 99% of my covid info comes from here (filtered for US only)

https://old.reddit.com/r/Coronavirus/new/

Drastic

  • Need a bigger hammer...
  • Global Moderator
  • Hero Member
  • *****
  • Posts: 3084
  • Resident Redneck
    • View Profile
Re: I kinda miss those 5000 search engines.
« Reply #29 on: February 18, 2022, 01:45:29 PM »
>Also, don't forget that you can revert to the "old" text version of subreddits. Handier for scanning.
Is new even usable for you? I cannot stand it as it's completely shite for me.

Also recommended is RES, reddit enhancement suite. Lots of handy shortcuts and tools as a browser addon.