Selfhosted
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
Rules:
-
Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
-
No spam posting.
-
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
-
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
-
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
-
No trolling.
Resources:
- selfh.st Newsletter and index of selfhosted software and apps
- awesome-selfhosted software
- awesome-sysadmin resources
- Self-Hosted Podcast from Jupiter Broadcasting
Any issues on the community? Report it using the report flag.
Questions? DM the mods!
view the rest of the comments
Yeah I found some stats now and indeed you’re gonna wait like an hour to process if you throw like 80-100k token into a powerful model. With APIs that kinda works instantly, not surprising but just to give a comparison. Bummer.
Anyways, the important thing is the "TOPS" aka trillions of operations per second. Having enough ram in important, but if you don't have a fast processor than you're wasting ram while you can just stream it from a fast ssd.
One such cases is when your system can't handle more than 50 tops, like the apple m systems. Try an old gpu, and enjoy 1000's of tops
Application Programming Interface, are you talking about something on the internet? On a gpu driver? On your phone?
Then also, what's the size model you're using? Define with int32? fp4? Somewhere in between? That's where ram requirements come in
I get that you're trying to do a mic drop or something, but you're not being very clear