this post was submitted on 13 Aug 2023
1 points (100.0% liked)

Technology

37717 readers
408 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
top 10 comments
sorted by: hot top controversial new old
[–] [email protected] 1 points 1 year ago (2 children)

I'd bet sites blocking ChatGPT will regret it when (not if) Bing starts using it for search engine relevance.

[–] [email protected] 1 points 1 year ago

That’s because you block the GPT crawler doesn’t mean you are no longer indexed

[–] [email protected] 0 points 1 year ago (1 children)

Serious question — you think any amount of AI will make people use Bing? 🤔

[–] [email protected] 0 points 1 year ago* (last edited 1 year ago) (1 children)

I started using it this year because its actually been giving me decent results unlike google. We're...in a dark timeline

[–] [email protected] 0 points 1 year ago* (last edited 1 year ago) (1 children)

Hmm. I have an incredibly strong aversion to everything Microsoft, so even giving Bing a chance is difficult. However, I must admit that I can recognize the part about Google not delivering. I even went so far as to tamper with the CSS recently just to make Google's results slightly easier to parse.

Maybe it's time to try something new 🤔 I just wish the only viable alternative wasn't made by Microsoft 😓🤢

Dark timeline, indeed! 😔

[–] [email protected] 1 points 11 months ago

Check out Kagi, paid search is extremely worth it. Stop being a product to sell and start being a customer.

[–] [email protected] 1 points 1 year ago (1 children)

But for large website operators, the choice to block large language model (LLM) crawlers isn't as easy as it may seem. Making some LLMs blind to certain website data will leave gaps of knowledge that could serve some sites very well (such as sites that don't want to lose visitors if ChatGPT supplies their information for them), but it may also hurt others. For example, blocking content from future AI models could decrease a site's or a brand's cultural footprint if AI chatbots become a primary user interface in the future. As a thought experiment, imagine an online business declaring that it didn't want its website indexed by Google in the year 2002—a self-defeating move when that was the most popular on-ramp for finding information online.

Really curious how this will end up

[–] [email protected] 1 points 1 year ago

That's an interesting point that I hadn't considered, the comparison to Google indexing in the early 2000's may prove to be very apt with the number of people I've seen using chat GPT as a search engine.

[–] [email protected] 0 points 1 year ago (1 children)

Lemmy.ca added a block at the nginx level for it

https://lemmy.ca/comment/1999439

# curl -H 'User-agent: GPTBot' https://lemmy.ca/ -i
HTTP/2 403
[–] [email protected] 1 points 1 year ago

Hilariously, unless ALL lemmy instances do this, anyone that federates with you will have to block it too or any communities they sync with you will be available on their instances...