#ocaml on 2022-03-18 — irc logs at libera.irclog.whitequark.org

2021-09-27 14:53 Leonidas changed the topic of #ocaml to: Discussion about the OCaml programming language | http://www.ocaml.org | OCaml 4.13.0 released: https://ocaml.org/releases/4.13.0.html | Try OCaml in your browser: https://try.ocamlpro.com | Public channel logs at https://libera.irclog.whitequark.org/ocaml/

00:07 <d_bot_> <anmonteiro> oh siiiiiit

00:07 <d_bot_> <anmonteiro> watching 🙂

00:08 <d_bot_> <anmonteiro> oh I guess IRC doesn't see that I replied to the open telemetry link 😛

00:08 <companion_cube> no worries :)

00:08 <companion_cube> I'm wondering if it should be functorized over IO or something

00:09 <companion_cube> (which, thinking of it, might even be useful with effects)

00:18 <d_bot_> <anmonteiro> not sure how it's structured, but I've come to prefer a runtime agnostic library where possible and I/O runtimes on top of it

00:18 <d_bot_> <anmonteiro> e.g. the http/af and h2 state machines

00:19 <d_bot_> <anmonteiro> I don't think all protocols can necessarily fit that model though

00:19 <companion_cube> ah well, here it's an IO consumer, in a way

00:19 <companion_cube> I mean it's like a `logs` reporter, so I should draw inspiration from there

00:20 <companion_cube> (btw integration with `logs` might be a super useful feature)

00:26 <d_bot_> <anmonteiro> very good point

00:29 <companion_cube> many things to adjust (e.g. batching of traces, I think, would help a lot)

00:36 <d_bot_> <Anurag> For high traffic scenarios you’d most likely also want to implement some form of sampling of traces.

00:37 <companion_cube> yeah there's a flag for that, not sure what the API would look like

00:37 <companion_cube> (maybe the tracing probe would be probabilistic?)

00:41 Tuplanolla has quit [Quit: Leaving.]

00:51 kaph has quit [Read error: Connection reset by peer]

00:51 kaph has joined #ocaml

01:18 qwr has quit [Quit: ↑]

01:25 qwr has joined #ocaml

02:23 xgqt has quit [Ping timeout: 252 seconds]

02:24 xgqt has joined #ocaml

02:41 gravicappa has joined #ocaml

03:16 rgrinberg has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

03:47 mbuf has joined #ocaml

03:49 chiastre has quit [Ping timeout: 250 seconds]

03:57 chiastre has joined #ocaml

04:15 waleee has quit [Ping timeout: 252 seconds]

04:20 <d_bot_> <darrenldl> whats the usual package for a sexp type?

04:20 <d_bot_> <darrenldl> (and well parsing/printing

04:30 <companion_cube> sexplib?

04:30 <companion_cube> (if you use containers there's one in there too)

04:33 rgrinberg has joined #ocaml

04:42 <sleepydog> Fmt.parens :p

04:42 <sleepydog> for printing, that is

04:44 zebrag has quit [Ping timeout: 252 seconds]

04:44 <d_bot_> <darrenldl> companion_cube: yeah was using ccsexp, but now moving sexp stuff into a separate package

04:45 <d_bot_> <darrenldl> and seems a bit of an overkill to install containers just to do exactly only sexp in that sub package

04:45 <d_bot_> <darrenldl> (trying to slim down deps of timedesc

04:47 <d_bot_> <darrenldl> hm everything uses sexplib basically huh, seems like perfect choice, cheers

04:48 <companion_cube> @darrenldl funny, ccsexp is there to not have to pull something just for sexprs :D

04:48 <companion_cube> (… if you already have containers)

05:05 <d_bot_> <darrenldl> companion_cube: can ccsexp be a standalone package : v

05:55 gravicappa has quit [Ping timeout: 250 seconds]

06:13 rgrinberg has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

06:41 tizoc has quit [*.net *.split]

06:41 Absalom has quit [*.net *.split]

06:41 cross has quit [*.net *.split]

06:44 tizoc has joined #ocaml

06:44 Absalom has joined #ocaml

06:44 cross has joined #ocaml

07:22 mro has joined #ocaml

07:22 kurfen has quit [Ping timeout: 240 seconds]

07:47 mro has quit [Remote host closed the connection]

07:47 spip has joined #ocaml

07:47 bobo has quit [Ping timeout: 256 seconds]

07:48 <sadiq> companion_cube, yes. So you could see GC and runtime trace events.

07:48 mro has joined #ocaml

08:30 kaph has quit [Read error: Connection reset by peer]

08:31 mro has quit [Remote host closed the connection]

08:32 mro has joined #ocaml

08:39 kaph has joined #ocaml

08:40 Absalom has quit [Quit: the lounge - https://webirc.envs.net]

08:41 Absalom has joined #ocaml

08:41 bartholin has joined #ocaml

08:45 mro has quit [Remote host closed the connection]

08:46 mro has joined #ocaml

08:48 Everything has quit [Quit: leaving]

08:56 olle has joined #ocaml

09:00 OCamlPro[m] has quit [Quit: You have been kicked for being idle]

09:13 bartholin has quit [Ping timeout: 240 seconds]

09:13 bartholin has joined #ocaml

09:41 mro has quit [Remote host closed the connection]

10:03 <d_bot_> <wokalski> companion_cube: regarding lwt+tracing; what were your thoughts about the integration? We talked about this exact topic a couple of weeks ago with my team and we thought about creating a wrapper (`type 'a t = (ctx * 'a) Lwt.t`) that will carry the context around behind the scenes but maybe there's a smarter way.

10:16 gentauro has quit [Read error: Connection reset by peer]

10:16 gentauro has joined #ocaml

10:28 mro has joined #ocaml

10:33 mro has quit [Ping timeout: 252 seconds]

10:49 gravicappa has joined #ocaml

10:52 bartholin has quit [Ping timeout: 268 seconds]

10:54 bartholin has joined #ocaml

11:07 waleee has joined #ocaml

11:33 kurfen has joined #ocaml

13:14 <companion_cube> @darrendl tough ask :D

13:15 <d_bot_> <darrenldl> (:

13:16 <companion_cube> @wokalski: @antron has some cool tricks in Dream, maybe Lwt.key might be a good fit?

13:21 <d_bot_> <Anurag> @companion_cube Lwt.key is something I looked at for similar needs in the APM library I worked on. But I see it marked as deprecated <https://github.com/ocsigen/lwt/blob/master/src/core/lwt.mli#L1691-L1700> so I didn't push forward with that approach. I instead ended up using `with_` style functions and manually forwarding the trace context everywhere I needed to (including forwarding it to logs for adding the trace context as a

13:22 <companion_cube> yeah I guess

13:23 waleee has quit [Quit: WeeChat 3.4]

13:34 szkl has quit [Quit: Connection closed for inactivity]

13:39 bartholin has quit [Ping timeout: 240 seconds]

13:42 Haudegen has joined #ocaml

13:52 bartholin has joined #ocaml

13:53 <companion_cube> you need some manual work anyway, if you're forwarding trace_id/span_id to remote hosts

14:08 <d_bot_> <Anurag> That is true. So far my approach has been to create http clients from a given trace context. Then all calls made using that client will automatically add a trace context http header when making outgoing calls.

14:08 <companion_cube> so I'm looking at Logs, and the reporter interface seems very promising

14:09 <companion_cube> one interface can serve both sync and async needs

14:25 d_bot_ has quit [Remote host closed the connection]

14:26 d_bot has joined #ocaml

14:36 olle_ has joined #ocaml

14:38 mro has joined #ocaml

14:56 mro has quit [Remote host closed the connection]

15:09 mro has joined #ocaml

15:10 mro has quit [Remote host closed the connection]

15:15 dextaa_ has joined #ocaml

15:15 <d_bot> <mbacarella> I kinda wish libraries that provided independent I/O endpoints just implemented a synchronous interface so that you can wrap them in `In_thread.run (fun () -> ...)` if you want an Async version

15:15 <d_bot> <mbacarella> dealing with functorized promise modes in every single library in the world is kind of a drag

15:16 <companion_cube> the Logs thing should work, actually, in this case

15:16 <companion_cube> with a small Lwt wrapper

15:16 <companion_cube> I just need an async reporter now :D

15:17 Serpent7776 has quit [Read error: Connection reset by peer]

15:19 <d_bot> <mbacarella> `Lwt_preemptive.detach`

15:19 <d_bot> <mbacarella> Lwt wrapper: done

15:19 <companion_cube> nah, not even

15:19 <companion_cube> https://github.com/AestheticIntegration/ocaml-opentelemetry/blob/master/src/lwt/opentelemetry_lwt.ml#L13-L20 :)

15:20 <companion_cube> again, imitating Logs.reporter

15:20 <companion_cube> it's quite a nice API really

15:20 <companion_cube> a callback to return a value (here, a promise); and a callback to signal the job is done (here, to wakeup the promise)

15:20 <d_bot> <orbitz> I think that would be a subpar experience. My preference is for people to offer an API that takes bytes and gives back decoded frames, and then you can implement a sync or async API on top of that.

15:21 <companion_cube> depends for what

15:22 <companion_cube> for a protocol, I agree, I guess? not sure it's easier than the functor though

15:22 <d_bot> <orbitz> I haven't seen it done very successful as a functor. Cohttp, for example, is a pain to implement, IMO

15:22 Serpent7776 has joined #ocaml

15:23 <companion_cube> well if you want to see an incredibly large scale and robust system (lolol), see: irc-client

15:23 <companion_cube> (warning: a smidge sarcastic here :D)

15:24 <d_bot> <orbitz> It was hard to tell over the lol's 🙂

15:24 <companion_cube> you never know!

15:25 <d_bot> <mbacarella> the reason dealing with functorized promise monads is kind of a drag is because probably the author only implements either lwt or async

15:26 <d_bot> <mbacarella> but also if you're trying to release a library you feel a little guilty because you've implemented only async or lwt bindings

15:27 <d_bot> <mbacarella> that is, there's pressure to do the functorized promise monads. the concept alone creates some angst

15:28 <d_bot> <orbitz> To support companion_cube and not support my claim: dns-client does something not terrible in that its API surface is fairly small so you do implement a functor with monad + their API and then it takes care of calling the right things. I think it works ok there, but it's delicate work and, for example, I had to request (and was thankfully obliged) to change the API in a way that made handling cancellation correctly possible

15:31 unyu has quit [Quit: brb]

15:36 <companion_cube> @mbacarella it does mean you can relatively easily add the other one, whereas if you don't functorize (or do the stateless IO-less version) you have to rewrite everything

15:37 <d_bot> <mbacarella> sorry, rephrase?

15:38 <companion_cube> you might functorize and do only lwt

15:38 <companion_cube> but adding async is then easier than if you had no functor to start with

15:38 <d_bot> <mbacarella> what i'm saying is there's no shame in your wrapper being `In_thread.run (fun () -> let h = Sync_io.create () in Sync_io.send_request h request; let reply = Sync_io.read_response h; Sync_io.close h; reply)`

15:38 <companion_cube> for a stub, sure

15:39 <companion_cube> for actual use, meh :D

15:39 <d_bot> <mbacarella> i mean, the functorized monad version will be doing something close to that behind the scenes anyway

15:40 <d_bot> <mbacarella> i will reiterate i meant for independent I/O endpoints. if you have to share handles between many calls yeah that has obvious issues

15:42 <companion_cube> hmm, In_thread means using a thread pool? might actually scale decently, I guess

15:45 <d_bot> <mbacarella> IIRC all I/O in Async is dispatched to a thread-pool

15:46 <d_bot> <orbitz> No

15:46 <d_bot> <orbitz> FIle I/O probably is

15:47 <companion_cube> on linux, probably

15:47 olle has quit [Quit: Lost terminal]

15:47 olle_ has quit [Remote host closed the connection]

15:49 rgrinberg has joined #ocaml

15:50 rgrinberg has quit [Client Quit]

15:51 <d_bot> <mbacarella> hmm, indeed, async_unix src/unix_syscalls.ml is heavy on the In_thread. stuff except for socket stuff

15:51 mro has joined #ocaml

15:56 <d_bot> <mbacarella> I wonder why the break-out for network stuff. Guessing the payloads are tinier and more latency sensitive.

15:56 mro has quit [Remote host closed the connection]

15:56 <d_bot> <orbitz> @pilothole No, it's because most OS's have functioning non-blocking network APIs, but not for file systems

15:57 <d_bot> <mbacarella> ah, that sounds more plausible

15:58 <companion_cube> io_uring is supposed to change this for linux

15:58 <companion_cube> with some exciting new vulnerabilities, too

15:58 <d_bot> <orbitz> It's not progress without a security hole or two

15:59 <d_bot> <mbacarella> you would still probably rather use the threaded interface instead of managing a non-blocking state machine if you could get away with it? I imagine it doesn't perform as awesome

16:00 <d_bot> <orbitz> Threads do not performa wesome

16:02 <d_bot> <mbacarella> actually, why not? you have context switches whether you use threads or non-blocking I/O

16:02 <d_bot> <mbacarella> in fact, i could imagine one argument where it's better to have the kernel operate the state machine for you and let you have concurrency through threading because the kernel knows more about what's going on and is heavily optimized

16:03 <d_bot> <orbitz> @pilothole The information around why is more efficient to use the non-blocking APIs is pretty detailed on the internet. You can start with C10k problem, which is quite old

16:03 <companion_cube> @orbitz depends for what, i guess

16:03 <companion_cube> doesn't scale as well, for sure

16:03 <companion_cube> (although c10k is doable)

16:04 <d_bot> <orbitz> Sure, c10k is not large these days, but the principles still apply

16:04 <d_bot> <orbitz> My point is: this isn't some whacky decision made by Ocaml developers, there is a pretty robust discussion that's gone on over the decades

16:04 <d_bot> <mbacarella> i didn't say it was a wacky decision, just trying to remember from first principles

16:04 <d_bot> <mbacarella> for the sake of argument

16:07 <sim642> Uhoh, this is confusing: Error: This expression should not be a function, the expected type is Ppx_deriving_ord_helper.t -> Ppx_deriving_runtime.int

16:08 waleee has joined #ocaml

16:11 <d_bot> <Anurag> @pilothole async uses `In_thread` when performing file io since that doesn't support nonblocking operations. For socket io (If Fd.t supports nonblocking mode) it doesn't go through In_thread (unless you are performing a writev call with a large number of iovecs)

16:12 <d_bot> <Anurag> Ah, nvm, you just said that a little while ago (I should have scrolled further 😄 )

16:14 <companion_cube> sim642: weird indeed

16:14 <d_bot> <Anurag> async (and lwt) does make working with nonblocking IO pretty straightforward so as a user of Lwt_io/Reader/Writer (or any other wrapper over some lwt/async primitives), one doesn't really have to think about managing IO threads, or orchestrating non blocking state machines.

16:16 <d_bot> <mbacarella> anyway, haven't threads traditionally sucked for c10k level concurrency because the OS doesn't make smart decisions about what to schedule next?

16:16 <d_bot> <mbacarella>

16:16 <d_bot> <mbacarella> i recall an old observation that if you had N processes waiting on accept the kernel would wake all of them only to put N-1 of them back to sleep, because only one can get the next client

16:16 <d_bot> <mbacarella> (the kernel here being linux)

16:17 <d_bot> <mbacarella> or, stated another way, you could blow my argument up where when i say "it's better to have the kernel operate the state machine for you and let you have concurrency through threading because the kernel knows more about what's going on and is heavily optimized" i'm being hopelessly naive

16:18 <d_bot> <orbitz> The general issue is threads are heavy and and if you have lots of connections, you use up a lot of RAM + context switches

16:21 <d_bot> <mbacarella> right. but assuming you had a perfectly executed concept, why should 1000 threads waiting on 1000 different descriptors be heavier than one thread waiting to find out about an event in 1 of 1000 descriptors? why does one model require more context switches than the other?

16:22 <d_bot> <mbacarella> (true 1000 different threads means 1000 different stacks)

16:23 mro has joined #ocaml

16:24 <d_bot> <orbitz> It depends on the specifics. But let's say you wrap "read" in an In_thread.run, you need to (possibly) spawn a thread, wait for its response, context switch over to it, read (syscall), notify that the result is ready, context switch back, handle the notification is ready, and continue, and maybe kill a thread. The first and last are alleviated by a thread pool to some degree

16:24 <d_bot> <orbitz> if you batch more operations, then context switching becomes less dominating

16:25 <d_bot> <orbitz> But per thread you have 4 context switches in that scenario: into and out of the thread, and the syscall

16:25 <d_bot> <orbitz> at least 4

16:26 <companion_cube> otoh if you read gigantic files in a few threads, I don't think lwt would help

16:26 <companion_cube> performance isn't 1 dimensionnal

16:26 <d_bot> <orbitz> Excuse me, I'm referring specfically to network here. I should have been more explicit

16:26 <d_bot> <mbacarella> ah, fair, in the ocaml model dispatching things to thread pools is 4x as many context switches as needed

16:26 <d_bot> <orbitz> @pilothole what I just described has nothing to do with ocaml

16:27 <d_bot> <mbacarella> sorry, i meant the ocaml models we have been talking about

16:27 <d_bot> <orbitz> What I described is inherent to any unixy thread model

16:28 <d_bot> <orbitz> Go and Erlang handle pushing I/O to an event loop for you for example. High performance networking APIs like Java's Jetty use an even tloop as well

16:29 <d_bot> <orbitz> Netty, maybe I menat

16:29 <d_bot> <rudy> Does anyone have a nice resource on what OCaml is good for? I'm looking for something in-depth, and perhaps with comparisons to other famous languages like Java or C.

16:29 <d_bot> <orbitz> @rudy It's a general purpose programming language, so it's good for all the things!

16:29 <Corbin> rudy: Yes. Additionally, it can.

16:29 waleee has quit [Ping timeout: 240 seconds]

16:30 <d_bot> <mbacarella> kind of? the reason for the 4x context switches in this approach is because you're throwing away the possibility of sequencing by dispatching everything to an I/O threadpool

16:30 <d_bot> <orbitz> @pilothole I don't undertand what you mean. The scenario I described was pretty specific.

16:31 <d_bot> <orbitz> As I said, context switches become less dominating the more amount of work you do in the thread, but if you are running a specific syscall in a thread, you have a bunch of context switches

16:31 waleee has joined #ocaml

16:34 <d_bot> <Et7f3 (@me on reply)> https://www.ssi.gouv.fr/uploads/IMG/pdf/Mind_Your_Languages_-_version_longue.pdf

16:35 <d_bot> <Et7f3 (@me on reply)> it is not one vs one but it show arguments that help you choose

16:37 <d_bot> <Et7f3 (@me on reply)> The conclusion I took from this: when a langage is static and include many warnings it become safer by design.

16:37 <companion_cube> @orbitz yeah yeah, for networking it's quite clear

16:38 <d_bot> <mbacarella> what i'm saying is you could probably eliminate a context switch if you don't have to enter and exit a threadpool if you do two system calls in a row if they depend on each other (e.g. a thread dedicated to reading a request and then sending a reply), but i now agree with you/remember threads suck because you wrap each system call context switch in a thread entry/exit context switch

16:39 <d_bot> <orbitz> @pilothole That would be "doing more stuff in the thread" as I mentioned which will reduce your context switches.

16:39 <d_bot> <mbacarella> right

16:40 <d_bot> <orbitz> But even still, depending on your scale, you're using more resources than are necessary

16:40 <d_bot> <Et7f3 (@me on reply)> log4j come from a hyper dynamic language where deserialization can run arbitrary code. It is not the first time https://opensource.googleblog.com/2017/03/operation-rosehub.html @rudy The time you earn by developing quickly is the time you will lose later in worse. JS was coded in 7 day and now many webdev spend time on creating tool to fix it.

16:41 <d_bot> <mbacarella> yes, i agree. i will withdraw the contention. threads sux non-blocking I/O roolz

16:42 <d_bot> <orbitz> @pilothole Consider you have 100 connections and they all have something to read at the same time: you can do 1 kqueue call + 100 reads (202 context switches) or you can do 100 * 4, at least, context switches

16:42 <d_bot> <mbacarella> yes right clearly worse

16:43 mro has quit [Remote host closed the connection]

16:49 mbuf has quit [Quit: Leaving]

16:54 <d_bot> <Et7f3 (@me on reply)> @Bluddy description of #débutants is not clickable you can write <# 913078886345085008> without espace. And why we have notifications in #rules did you modified something ?

17:00 zebrag has joined #ocaml

17:04 Haudegen has quit [Quit: Bin weg.]

17:06 <d_bot> <mbacarella> neat article

17:22 <d_bot> <Et7f3 (@me on reply)> reposted in #share

17:53 rgrinberg has joined #ocaml

17:54 oriba has joined #ocaml

17:55 kakadu has quit [Remote host closed the connection]

18:10 Techcable has quit [Remote host closed the connection]

18:10 Techcable has joined #ocaml

18:20 chrisz has quit [Ping timeout: 252 seconds]

18:21 chrisz has joined #ocaml

18:22 Haudegen has joined #ocaml

18:38 dextaa_ has quit [Remote host closed the connection]

18:40 bartholin has quit [Quit: Leaving]

18:48 <d_bot> <mbacarella> so you have a `type foo = Foo | Bar | Baz` exposed in a library. you want to get free sexp conversions for it without copy/pasting it into your library. sadly `type foo = Library.foo [@@deriving sexp]` doesn't work

18:50 <d_bot> <EduardoRFS> ppx_import

18:58 <d_bot> <mbacarella> that's awesome

19:02 olle has joined #ocaml

19:32 mro has joined #ocaml

19:36 mro has quit [Ping timeout: 240 seconds]

19:42 bobo has joined #ocaml

19:42 spip has quit [Ping timeout: 252 seconds]

19:51 t-j-r has quit [Quit: quitting]

19:54 infinity0 has quit [Remote host closed the connection]

20:04 infinity0 has joined #ocaml

20:11 mro has joined #ocaml

20:11 rgrinberg has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

20:17 gravicappa has quit [Ping timeout: 240 seconds]

20:34 mro has quit [Ping timeout: 256 seconds]

20:35 kaph has quit [Ping timeout: 240 seconds]

20:44 kaph has joined #ocaml

21:22 unyu has joined #ocaml

21:42 wyrd has quit [Ping timeout: 240 seconds]

21:44 wyrd has joined #ocaml

21:47 olle has quit [Remote host closed the connection]

22:13 oriba has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

22:17 Serpent7776 has quit [Quit: leaving]

22:29 Tuplanolla has joined #ocaml