@mierdabird

mierdabird@lemmy.dbzer0.com · 7 hours ago

In terms of the tinyminimicro’s I think i5-6500T 7500T or 8500T (T signifies 35w TDP) could all fit your price point depending on RAM/SSD specs. I haven’t done much research on the n100 processors but I think they are broadly comparable to the above i5’s

mierdabird@lemmy.dbzer0.com · edit-2 7 hours ago

While I get leaning towards AMD products, I’ve been doing so as well, when I built my first server with a Ryzen 5 2400GE I have found that there just isn’t as much resources/support for enabling transcoding with the vega 11 in Jellyfin or Immich. Most Intel iGPU’s have a hardware chip specifically tuned for transcoding called quicksync that you should strongly consider.

Especially in the $100-200 price range tiny mini micro’s from HP/Lenovo/Dell are widely available and offer lots of capability in a power-efficient (~10-15w idle, 40-50w full load) and easily maintainable form factor. The Lenovo’s in particular are interesting due to a few models having full pci-e slots if you decide later you want a GPU.
Lenovo pci-e

Finally for software I would suggest looking into Cosmos Cloud, I use it and have found it made it so much easier to setup and manage all my docker containers and domain name/reverse proxy settings.

mierdabird@lemmy.dbzer0.com · 8 hours ago

Any Intel CPU with quicksync will likely be plenty transcoding capability for his use case with significantly lower power draw

mierdabird@lemmy.dbzer0.com · 6 days ago

Even trucks like these don’t need a giant hood like that, it’s a design choice to look tough and it could really be regulated away

mierdabird@lemmy.dbzer0.com · 13 days ago

YA can never be too careful, you might be inside my network at this very moment and hiding the internal IP is my last line of defense! 😆

mierdabird@lemmy.dbzer0.com · 13 days ago

Not sure who downvoted lol but here’s proof

mierdabird@lemmy.dbzer0.com · 13 days ago

Your Roku should be trying to connect to the internal IP:8096 (Jellyfin port) of your arch device, not whatever your tailscale address is. I don’t personally use tailscale so if your setup blocks local access then you may need to solve that first

mierdabird@lemmy.dbzer0.com · edit-2 15 days ago

If this is your first time trying to selfhost I highly recommend Cosmos Cloud, I’ve been using it for 6 months and it’s made every step of the way so much easier for me. It manages docker containers and has included reverse proxy and security features, with paid option for personal VPN like tailscale.

Most services work perfectly from a catalog of pre-built docker compose files, but Jellyfin I remember I did have to go to the internal docker IP on the actual host machine to set the server up and working properly to be visible from other machines

mierdabird@lemmy.dbzer0.com · 17 days ago

Bro 2.0 came so fast I didn’t even have time to do 144, like why did they even bother releasing that when 2.0 was coming the very next day lol

mierdabird@lemmy.dbzer0.com · edit-2 19 days ago

Like you ended up doing a PiHole at home? I’m surprised there’s no access control. I was on the verge of setting that or Adguard home up for myself but realized using Adguard’s public servers is effectively the same thing, just without the extra privacy of hosting at home.

mierdabird@lemmy.dbzer0.com · 19 days ago

I’ve run the duckduckgo version of this for years but only recently found out you can get most of this functionality natively in android (android 13 for me) by setting a private DNS as shown in the below image. My duckduckgo app tracking protection does still catch attempts but it’s basically just google now, instead of dozens of companies before.

mierdabird@lemmy.dbzer0.com · 20 days ago

I’m surprised you’re getting disappointing results with Qwen 3 Coder 480b. I run Qwen 2.5 coder 14b locally (Open WebUI + Ollama) on my 3060 12gb and I’ve been pretty pleased with it’s answers so far relating to python code, Django documentation/settings, and quirks with my reverse proxy.

I assume you aren’t hosting the 480b locally right? Are you using Open WebUI and an Open API key?

mierdabird@lemmy.dbzer0.com · 1 month ago

I initially installed Ollama/OpenWebUI in my HP G4 Mini but it’s got no GPU obviously so with 16GB ram I could run 7b models but only 2 or 3 tokens/sec.
It definitely made me regret not buying a bigger case that could accomodate a GPU, but I ended up installing the same Ollama/OpenWebui pair on my windows desktop with a 3060 12gb and it runs great - 14b models at 15+ tokens/sec.
Even better, I figured out that my reverse proxy on the server is capable of redirecting to other addresses in my network so now I just have a dedicated subdomain URL for my desktop instance. It’s OpenWebUI is now just as accessible remotely as my server’s.

mierdabird@lemmy.dbzer0.com · 1 month ago

When on your wifi, try navigating in your browser to your windows computer’s address with a colon and the port 11434 at the end. Would look something like this:

http://192.168.xx.xx:11434/

If it works your browser will just load the text: Ollama is running

From there you just need to figure out how you want to interact with it. I personally pair it with OpenWebUI for the web interface

mierdabird@lemmy.dbzer0.com · 1 month ago

Tesla was the first to make it popular in modern vehicles iirc

mierdabird@lemmy.dbzer0.com · 1 month ago

Not really sure I understand how these work, do you just feed it a large textual document like a transcript or something, and it turns it into a more machine readable vector format or something?

Or is it just a much smaller LLM that’s more optimized for reading than generating?

mierdabird@lemmy.dbzer0.com · 2 months ago

The update is giving me a performance uplift on my 3060 that’s WAY more than 7%, using qwen2.5-coder:14b-instruct-q5_K_M here’s rerunning the exact same prompt before and after:

mierdabird@lemmy.dbzer0.com · 2 months ago

So I googled it and if you have a Pi 5 with 8gb or 16gb of ram it is technically possible to run Ollama, but the speeds will be excruciatingly slow. My Nvidia 3060 12gb will run 14b (billion parameter) models typically around 11 tokens per second, this website shows a Pi 5 only runs an 8b model at 2 tokens per second - each query will literally take 5-10 minutes at that rate:
Pi 5 Deepseek
It also shows you can get a reasonable pace out of the 1.5b model but those are whittled down so much I don’t believe they’re really useful.

There are lots of lighter weight services you can host on a Pi though, I highly recommend an app called Cosmos Cloud, it’s really an all-in-one solution to building your own self-hosted services - it has its own reverse proxy like Nginx or Traefik including Let’s Encrypt security certificates, URL management, and incoming traffic security features; it has an excellent UI for managing docker containers and a large catalog of prepared docker compose files to spin up services with the click of a button; it has more advanced features you can grow into using like OpenID SSO manager, your own VPN, and disk management/backups.
It’s still very important to read the documentation thoroughly and expect occasional troubleshooting will be necessary, but I found it far, far easier to get working than a previous Nginx/Docker/Portainer setup I used.

mierdabird@lemmy.dbzer0.com · 2 months ago

Using Ollama depends a lot on the equipment you run - you should aim to have at least 12gb of VRAM/unified memory to run models. I have one copy running in a docker container using CPU on Linux and another running on the GPU of my windows desktop so I can give install advice for either OS if you’d like

mierdabird@lemmy.dbzer0.com · edit-2 2 months ago

I’m actually right there with you, I have a 3060 12gb and tbh I think it’s the absolute most cost effective GPU option for home use right now. You can run 14B models at a very reasonable pace.
Doubling or tripling the cost and power draw just to get 16-24gb doesn’t seem worth it to me. If you really want an AI-optimized box I think something with the new Ryzen Max chips would be the way to go - like an ASUS ROG Z-Flow, Framework Desktop or the GMKtek option whatever it’s called. Apple’s new Mac Minis are also great options. Both Ryzen Max and Apple make use of shared CPU/GPU memory so you can go up 96GB+ at much much lower power draws.