#ruby on 2025-02-09 — irc logs at libera.irclog.whitequark.org

2025-01-15 09:40 havenwood changed the topic of #ruby to: Ruby 3.4.1, 3.3.7 https://www.ruby-lang.org | Log https://libera.irclog.whitequark.org/ruby

00:11 joto has quit [Ping timeout: 265 seconds]

00:12 joto has joined #ruby

00:13 weyhmueller has quit [Ping timeout: 252 seconds]

00:14 weyhmueller has joined #ruby

00:17 Jado has joined #ruby

00:22 Jado has quit [Ping timeout: 252 seconds]

00:36 Jado has joined #ruby

00:40 Jado has quit [Ping timeout: 248 seconds]

00:49 Obsdark has quit [Quit: Nettalk6 - www.ntalk.de]

00:59 Jado has joined #ruby

01:06 Jado has quit [Ping timeout: 260 seconds]

01:09 Jado has joined #ruby

01:34 wbooze has quit [Quit: Leaving]

01:36 jasfloss has quit [Ping timeout: 252 seconds]

01:38 Inline has joined #ruby

01:49 wbooze has joined #ruby

01:56 CRISPR has joined #ruby

01:58 aindilis_ has quit [Quit: ZNC 1.8.2+deb3.1+deb12u1 - https://znc.in]

01:59 aindilis has joined #ruby

01:59 jasfloss has joined #ruby

02:02 CRISPR has quit [Ping timeout: 245 seconds]

02:05 Jado has quit [Ping timeout: 252 seconds]

02:11 Jado has joined #ruby

02:19 Jado has quit [Ping timeout: 268 seconds]

02:31 CRISPR has joined #ruby

02:35 eax_ has left #ruby [#ruby]

02:36 ih8u has quit [Quit: ih8u]

02:43 wbooze has quit [Read error: Connection reset by peer]

02:47 Inline has quit [Ping timeout: 268 seconds]

03:05 Jado has joined #ruby

03:18 dionysus69 has quit [Quit: dionysus69]

03:27 Jado has quit [Ping timeout: 244 seconds]

03:31 Jado has joined #ruby

03:36 Jado has quit [Ping timeout: 268 seconds]

03:43 CRISPR has quit [Quit: WeeChat 3.8]

04:01 Jado has joined #ruby

04:06 Jado has quit [Ping timeout: 265 seconds]

05:18 Jado has joined #ruby

06:00 xokia_ has quit [Ping timeout: 246 seconds]

06:05 patrick has quit [Ping timeout: 248 seconds]

06:05 patrick_ is now known as patrick

06:10 grenierm has joined #ruby

06:30 patrick_ has joined #ruby

06:30 patrick has joined #ruby

06:30 patrick has quit [Changing host]

06:30 patrick_ is now known as patrick

06:30 patrick_ has joined #ruby

06:37 Jado has quit [Ping timeout: 248 seconds]

06:38 konsolebox has joined #ruby

06:42 Jado has joined #ruby

06:47 dalan03822833508 has quit [Quit: dalan03822833508]

06:48 Jado has quit [Ping timeout: 265 seconds]

06:48 dalan03822833508 has joined #ruby

06:51 rdsm has joined #ruby

06:52 xokia_ has joined #ruby

07:05 konsolebox has quit [Ping timeout: 260 seconds]

07:19 xokia_ has quit [Read error: Connection reset by peer]

07:22 fantazo has joined #ruby

07:24 Jado has joined #ruby

07:26 xokia_ has joined #ruby

07:27 xokia_ has quit [Read error: Connection reset by peer]

07:28 Jado has quit [Ping timeout: 244 seconds]

07:44 Jado has joined #ruby

07:45 konsolebox has joined #ruby

07:48 nirvdrum7 has quit [Ping timeout: 252 seconds]

07:49 Jado has quit [Ping timeout: 248 seconds]

08:01 Jado has joined #ruby

08:12 rvalue has quit [Read error: Connection reset by peer]

08:12 rvalue has joined #ruby

08:47 hwpplayer1 has joined #ruby

09:00 gemmaro has quit [Read error: Connection reset by peer]

09:05 Jado has quit [Ping timeout: 252 seconds]

09:06 konsolebox has quit [Ping timeout: 248 seconds]

09:06 Jado has joined #ruby

09:10 Stenotrophomonas is now known as brokkoli_origin

09:10 hwpplayer1 has quit [Remote host closed the connection]

09:11 Linux_Kerio has joined #ruby

09:14 Tempesta has quit [Quit: See ya!]

09:14 gemmaro has joined #ruby

09:16 Jado has quit [Ping timeout: 252 seconds]

09:16 hwpplayer1 has joined #ruby

09:18 gemmaro has quit [Client Quit]

09:20 gemmaro_ has joined #ruby

09:26 Jado has joined #ruby

09:27 konsolebox has joined #ruby

09:31 Jado has quit [Ping timeout: 246 seconds]

09:39 hwpplayer1 has quit [Remote host closed the connection]

09:40 gemmaro_ has quit [Ping timeout: 252 seconds]

09:46 Jado has joined #ruby

09:46 Tempesta has joined #ruby

09:46 hwpplayer1 has joined #ruby

09:48 hwpplayer1 has quit [Remote host closed the connection]

09:53 hwpplayer1 has joined #ruby

09:54 gemmaro has joined #ruby

09:58 Jado has quit [Ping timeout: 248 seconds]

09:58 hwpplayer1 has quit [Read error: Connection reset by peer]

10:03 hwpplayer1 has joined #ruby

10:23 <o0x1eef> havenwood: Right. But on huggingface.co you have a lot of different types of models. AFAIK llama is specialized towards LLMs specifically. On huggingface.co you could use a model at a lower level of abstraction, more like a programmer's API and expose that over HTTP.

10:24 <o0x1eef> For example there's models specifically for text-to-speech, and so on.

10:24 <o0x1eef> Usually you'd want a decent GPU though.

10:26 Jado has joined #ruby

10:27 hwpplayer1 has quit [Remote host closed the connection]

10:28 grenierm has quit [Ping timeout: 240 seconds]

10:31 Jado has quit [Ping timeout: 265 seconds]

10:45 brokkoli_origin has quit [Ping timeout: 265 seconds]

10:47 hwpplayer1 has joined #ruby

10:52 brokkoli_origin has joined #ruby

10:53 brokkoli_origin has quit [Remote host closed the connection]

10:57 brokkoli_origin has joined #ruby

11:02 xokia_ has joined #ruby

11:03 Jado has joined #ruby

11:04 konsolebox has quit [Ping timeout: 276 seconds]

11:07 xokia_ has quit [Read error: Connection reset by peer]

11:08 Jado has quit [Ping timeout: 245 seconds]

11:23 konsolebox has joined #ruby

11:53 konsolebox has quit [Ping timeout: 268 seconds]

12:02 Jado has joined #ruby

12:15 KoUmas has joined #ruby

12:39 konsolebox has joined #ruby

13:03 Jado has quit [Ping timeout: 244 seconds]

13:08 Jado has joined #ruby

13:19 Jado has quit [Ping timeout: 246 seconds]

13:23 xokia_ has joined #ruby

13:36 Jado has joined #ruby

14:06 hwpplayer1 has quit [Remote host closed the connection]

14:19 xokia_ has quit [Read error: Connection reset by peer]

14:34 dviola has joined #ruby

14:37 Jado has quit [Ping timeout: 260 seconds]

14:50 Jado has joined #ruby

14:50 user71 has joined #ruby

14:54 Jado has quit [Ping timeout: 252 seconds]

14:57 xokia has joined #ruby

15:01 Starfoxxes has joined #ruby

15:10 Jado has joined #ruby

16:12 wbooze has joined #ruby

16:12 Inline has joined #ruby

16:20 hwpplayer1 has joined #ruby

16:21 Jado has quit [Ping timeout: 252 seconds]

16:26 hwpplayer1 has quit [Ping timeout: 252 seconds]

16:30 hwpplayer1 has joined #ruby

16:30 Jado has joined #ruby

16:34 rvalue- has joined #ruby

16:35 rvalue has quit [Ping timeout: 252 seconds]

16:35 Jado has quit [Ping timeout: 246 seconds]

16:39 Jado has joined #ruby

16:43 rvalue- is now known as rvalue

17:14 TomyLobo has joined #ruby

17:16 r3m has quit [Quit: WeeChat 4.6.0-dev]

17:18 r3m has joined #ruby

17:41 Jado has quit [Ping timeout: 268 seconds]

17:44 sweeTarts is now known as swee

17:45 Jado has joined #ruby

18:16 <havenwood> o0x1eef: As a tool, llama.cpp does support some multimodal models and it ships with a REST API server and the ability to download huggingface models directly as GGUF.

18:19 <havenwood> At least it supports multiple VL models. It lacks image and video generation support generally, AFIAK. An advantage it it's flexible to run on CPU or GPU and is easy to use. There are other inference runtimes that are better at running across multiple machines or specializing for certain hardware but llama.cpp is a fine one to quickly get working

18:19 <havenwood> with Ruby via REST.

18:21 <wbooze> anybody using rvm ?

18:21 <havenwood> Just for exploration I think it's fine. :) I'd use a separate tool for stable diffusion or flux.1 or whatever.

18:21 <havenwood> wbooze: As one of the lingering RVM maintainers who uses chruby, I'd not recommend RVM unless you have a compelling reason.

18:21 <havenwood> wbooze: Running into RVM issues or just considering it?

18:22 <wbooze> hoow am i supposed to install iruby ?? i did an gem install iruby, iruby console complains about not finding bundler

18:22 <havenwood> wbooze: What operating stystem?

18:22 <wbooze> susee

18:22 <havenwood> The modern choices are chruby/ruby-install, rbenv/ruby-build, asdf or mise.

18:23 <havenwood> Both asdf and mise install languages other than Ruby.

18:23 <wbooze> bundle install -gemfile=<path> works but until i cd to that dir iruby console complains about not being able to find bundler

18:23 <havenwood> The simplest thing that can possibly work beyond installing yourself and setting env vars is chruby/ruby-install.

18:23 <wbooze> afterwards when i iruby register from there the kernel is unable to find bundler too and it all doesn't work

18:25 <wbooze> so it all works when i cd to that dir, at least for the console, even registering a kernel from there the kernel does not run correctly

18:26 cappy has joined #ruby

18:33 <havenwood> wbooze: Check `which ruby` and `ruby -v` from both directories. Is it the same?

18:33 <wbooze> i only have 1 ruby

18:35 <havenwood> wbooze: Then check `which bundle` and `gem which bundler`?

18:35 <havenwood> wbooze: Modern Ruby ships with Bundler, so `bundle` being missing is suspicious that you're using an old Ruby. What version of Ruby?

18:36 <wbooze> .rvm/gems/ruby-3.4.1/bin/bundle and bundler

18:36 <havenwood> wbooze: Oh, RVM. If you're `cd`ing into a directory it may be autoswitching your Ruby.

18:37 <havenwood> Sanity check `rvm list default` and `rvm list`. Does the directory you're cding into have a `.ruby-version` or `Gemfile` file? If so, what version of Ruby do they specify?

18:43 joako has quit [Quit: quit]

18:48 <wbooze> it only has a Gemfile no .ruby-version

18:49 <wbooze> both rvm list default and rvm list list only 1 thing, because i currently have only 1 ruby

18:53 Jado has quit [Ping timeout: 252 seconds]

18:55 <havenwood> wbooze: Does the `Gemfile` have `ruby` directive? Like?: ruby "3.2.7"

18:55 <o0x1eef> havenwood: I still think if I was to develop a product, I'd go with a Python web service that exposed whatever model you may want to use underneath. That seems like a much better *developer* environment. I don't really see llama in the same light.

18:57 <havenwood> o0x1eef: Hem, I think of "Llama" as the Meta models, like Llama 3.3 70b and llama.cpp as needing the ".cpp" part to mean the Runtime.

18:57 <havenwood> o0x1eef: Yeah, fair that using an HTTP interface to stream JSON isn't what you'd want to do in prod. Handy for prototyping.

18:57 Exa has quit [Ping timeout: 244 seconds]

18:59 <o0x1eef> I wouldn't bother with llama.cpp if I was developing a product or piece of software. It's too limiting.

19:00 <havenwood> o0x1eef: For exploring models it's handy as a hand wave, I think. I've tried alternatives like Rust MLX C bindings but I end up spending a lot of time getting one model working where with llama.cpp or an MLX client I can try out a bunch quickly and figure out tooling before I commit.

19:00 <havenwood> o0x1eef: Fair that Python makes it generally easier to get bindings unless you want to go to C ones.

19:00 <havenwood> o0x1eef: Or with MLX they ship Swift ones too.

19:01 <havenwood> I guess it depends on what you're doing.

19:02 <havenwood> o0x1eef: I kinda agree the overhead and complexity is unacceptable beyond prototyping. I still think GGUF and MLX wrappers that just make it easy have use for prototyping.

19:02 <o0x1eef> I mean, if you want to build software, then the best path IMO is to use Python, and interface with models that way. You can continue to train them, you can expose a light web interface, etc. You have complete control. You can deploy at scale. llama.cpp falls down for anything that isn't just a hobby project IMO.

19:02 <havenwood> Kinda meant to be simple and run anywhere more than the best choice for production.

19:03 <o0x1eef> And if you know Ruby, Python is not that big of a jump. You can pick it up quickly.

19:03 Exa has joined #ruby

19:03 <havenwood> Yeah, modern Python smooths over many of the issues I used to have with it compared to Ruby.

19:03 <havenwood> You can even `exit` the REPL without it refusing while knowing what you mean!

19:04 <havenwood> o0x1eef: But then if you're doing a GUI, suddenly Python is sus and you probably want something compiled.

19:05 <o0x1eef> As a Rubyist I think it is a nice alternative approach. You can still have your web stack in Ruby / Rails, and then the AI part is just another web service that happens to be implemented in a different language.

19:05 <havenwood> You'd expect the venture-backed tools to use Rust or Zig but they tend to use Electron with JavaScript and thinly wrap a REST API deffering the runtime to an upstream tool.

19:06 Jado has joined #ruby

19:07 <havenwood> o0x1eef: I think you could argue for llama.cpp for a "runs on one machine" app meant to be portable. As soon as you're running some LLM SaaS it's a terrible choice.

19:07 <havenwood> vLLM or whatever is much better, for an off-the-shelf tool.

19:08 <havenwood> Sometimes the foot-in-the-door tools have a place, even if it's just a stepping stone you leave behind.

19:08 <o0x1eef> I'm more so thinking of a model for text to speech, text to image, image to video. These are all models that could fit nicely behind a web service, and you could certainly build a product on them, with the main web application Rails.

19:09 joako has joined #ruby

19:09 <havenwood> o0x1eef: Yeah, totally. Makes me think of Open WebUI. Again, they just thinly wrap llama.cpp but. 🤷

19:10 <havenwood> Nice RAG, search integration and such for a web ui. https://github.com/open-webui/open-webui#readme

19:11 <havenwood> Qwen's chat just uses Open WebUI straight up. I dunno if they changed the backend, but if not it's an example at some scale. https://chat.qwenlm.ai

19:11 Jado has quit [Ping timeout: 246 seconds]

19:12 <o0x1eef> It's cool indeed and a great option for self-hosting. I could see how it would be cool to set that up to be used within a LAN.

19:13 <o0x1eef> I think it's a different usecase to what I'm getting at though.

19:15 Jado has joined #ruby

19:15 Linux_Kerio has quit [Ping timeout: 252 seconds]

19:17 <o0x1eef> I will give you a more concrete example. I am working on a Rails application for taking bookmarks. I want an AI service that will help me with classification of those bookmarks. I don't need an LLM. I just need a model that's good at classification. I can pull one from hugginface.co, expose it over HTTP, and then it is a service my Rails app can use. The web service & interface with the model happens

19:17 <o0x1eef> in Python.

19:18 <o0x1eef> Anyway, I think I've spammed the channel enough :)

19:19 <havenwood> I think it's fair to talk about Rails connecting to LLMs and different strategies. Most GUIs, web or otherwise are using simple HTTP. SSE or in some cases WebSockets.

19:20 <havenwood> Or the fancy ones are using UDP with GRPC, like Exo: https://github.com/exo-explore/exo/blob/main/exo/networking/udp/udp_discovery.py

19:20 Jado has quit [Ping timeout: 252 seconds]

19:20 <havenwood> But that's I guess what you're talking about. Run your service from Python and figure out how to efficiently do discovery.

19:21 <havenwood> You can use Tailscale, radio, bluetooth, whatever for that part. Low bandwidth. I think that's why folk get by with JSON.

19:21 <havenwood> I agree we can do better.

19:22 <havenwood> It'd be interesting to do a Rails integration using WebSockets. I know Shopify was rewriting GRPC in pure Ruby, but dunno if it's mature enough to use yet and the old C-ext is crufty. Gives me pause.

19:23 <o0x1eef> It takes a lot of juice to run an LLM. At least before DeepSeek. IME It's less of an issue if you're focused solely on text classification. You could self-host the AI service and not rely on any third party. I haven't tried to deploy on amazon or whatnot, but locally, it is within the realm of possible and I don't need the internet for my app to run.

19:23 <havenwood> Rails could handle the standard SSE streaming interface fine but it's pretty lousy and most don't implement ID for retries or anything. Pretty simplistic. Does work.

19:25 <havenwood> o0x1eef: Even the real DeepSeek takes hundreds of GB of VRAM to run at speed. Just the Qwen and Llama distills can run on reasonable machines. I can barely partially offload the lowest 1.58-bit quant of DeepSeek R1 on a 128GB M4 Max. Funny when a cluster of Apple Silicon is the cheapest way to run a thing. >.>

19:26 <havenwood> Interesting idea to run Rails in the cloud on prem with GPU.

19:26 <havenwood> I've only been exploring locally.

19:30 <o0x1eef> I bought a gamer's PC with a decent GPU so I could test and develop locally. For focused models that do one thing and do it well, such as classification, it is more than capable. LLMs? Not so much.

19:31 KoUmas has quit [Ping timeout: 268 seconds]

19:32 Jado has joined #ruby

19:35 cappy has quit [Quit: Leaving]

19:35 <o0x1eef> The main take away is that there's a lot of models on huggingface.co, a lot of them are useful and don't require the resources of an LLM, so you can solve problems using AI and keep everything in-house. No need to talk to OpenAI. No need for the cloud. I feel like I'm rambling so I will take a break.

19:39 user71 has quit [Quit: Leaving]

19:45 <fantazo> if people do freelancing for ruby, where are projects to be found?

20:37 hwpplayer1 has quit [Remote host closed the connection]

20:38 Jado has quit [Ping timeout: 245 seconds]

20:44 konsolebox has quit [Ping timeout: 250 seconds]

20:52 Starfoxxes has quit [Remote host closed the connection]

20:58 nirvdrum7 has joined #ruby

21:00 Jado has joined #ruby

21:06 <dorian> anybody encounter a type validation/coercion gem that competes with dry-types?

21:06 <dorian> because i am about to launch dry-types into the sun

21:27 gemmaro has quit [Read error: Connection reset by peer]

21:30 gemmaro_ has joined #ruby

21:40 wbooze has quit [Quit: Leaving]

21:54 Inline has quit [Quit: Leaving]

22:21 xokia has quit [Quit: Leaving]

22:21 xokia has joined #ruby

22:38 graywolf has joined #ruby

22:40 Inline has joined #ruby

22:45 balrog has quit [Quit: Bye]

22:47 mange has joined #ruby

22:48 balrog has joined #ruby

23:00 ruby[bot] has quit [Remote host closed the connection]

23:00 ruby[bot] has joined #ruby

23:40 TomyLobo has quit [Read error: Connection reset by peer]