#ocaml on 2025-03-10 — irc logs at libera.irclog.whitequark.org

2024-05-29 14:49 companion_cube changed the topic of #ocaml to: Discussion about the OCaml programming language | http://www.ocaml.org | OCaml 5.2.0 released: https://ocaml.org/releases/5.2.0 | Try OCaml in your browser: https://try.ocamlpro.com | Public channel logs at https://libera.irclog.whitequark.org/ocaml/

00:00 alfiee has quit [Ping timeout: 248 seconds]

00:03 rgrinberg has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

00:08 deadmarshal_ has joined #ocaml

00:08 <companion_cube> You don't need lazy to be referentially transparent

00:08 <companion_cube> And it gets in the way of IOs a bit

00:14 ygrek has joined #ocaml

00:20 malte has joined #ocaml

00:25 malte has quit [Remote host closed the connection]

00:30 germ- has quit [Read error: Connection reset by peer]

00:32 germ has joined #ocaml

00:41 alfiee has joined #ocaml

00:46 alfiee has quit [Ping timeout: 265 seconds]

00:57 Haudegen has quit [Quit: Bin weg.]

01:04 Mister_Magister has quit [Quit: bye]

01:06 malte has joined #ocaml

01:07 Mister_Magister has joined #ocaml

01:14 <discocaml> <yawaramin> you kinda do need it. eg let `downloadFile :: IO ()` and `retry :: IO () -> IO ()`. then `retry downloadFile` works as expected if IO is lazy but not if it's eager

01:14 Everything has joined #ocaml

01:14 malte has quit [Remote host closed the connection]

01:17 malte has joined #ocaml

01:24 malte has quit [Ping timeout: 252 seconds]

01:26 alfiee has joined #ocaml

01:31 alfiee has quit [Ping timeout: 248 seconds]

01:35 <companion_cube> Just add `() ->` in front, whatever

01:36 <companion_cube> I mean it's actually even fine, the issue is when >>= is lazy afaik

01:36 <companion_cube> Here it's a question of hot vs cold

01:39 <discocaml> <yawaramin> `() ->` is laziness

01:40 <companion_cube> Not in the Haskell sense but anyway

01:41 malte has joined #ocaml

01:42 <discocaml> <yawaramin> yeah in the Haskell sense too. Haskell `IO a` is defined as basically `RealWorld -> (a, RealWorld)` ie a function that can be run ie laziness

01:42 <companion_cube> https://stackoverflow.com/questions/5892653/whats-so-bad-about-lazy-i-o

01:43 <companion_cube> No but Haskell adds a layer of lazyness

01:43 <companion_cube> It's not just the suspension

01:43 <discocaml> <yawaramin> https://fs2.io/#/

01:48 <companion_cube> but is it lazy like Haskell's IO is lazy?

01:51 <companion_cube> I've sometimes been wondering what it'd be like to have a sort of IO monad instead of Lwt

01:51 <companion_cube> but I'm not sure it's a good idea for OCaml

01:54 <dh`> in connection with $work I have been wrangling a DSL that someone built with monads because they were haskell people

01:54 <dh`> but they did not feel like implementing the whole nine yards, so you can't define your own monads, soe you can't actually do anything

01:55 <dh`> so part of what I'm trying to figure out is whether to rip the monads out in favor of a tasteful implementation of ref cells, or fix it to be useful

02:02 gzar has quit [Quit: WeeChat 4.5.2]

02:05 Tuplanolla has quit [Quit: Leaving.]

02:07 <discocaml> <diligentclerk> @companion cube - "You don't need lazy to be referentially transparent" Can you explain this? Say I have a function f : 'a -> IO 'b and I have a function g that calls f. I want to make it clear from the type signature of g that g has side effects. Laziness provides an easy way (intuitively) to track this, because you have to return the value to the main module in order to actually execute that effect, so it shows up in the type signature

02:10 <dh`> state monads encoded as functions as described above are inherently lazy that way anyway

02:10 <dh`> nothing runs until you feed them the starting state at the top level

02:11 <dh`> they have state -> instead of () ->, but it ends up serving the same purpose

02:12 alfiee has joined #ocaml

02:13 <discocaml> <diligentclerk> Okay. So if you have a strict IO monad it has a completely different implementation. How does the type system track effects in these languages?

02:13 <discocaml> <diligentclerk> Could I implement a strict IO monad in OCaml the same way we are talking about implementing a lazy IO monad above? or would this be at the level of the language implementation

02:14 <companion_cube> look, Lean and Coq are pure but not lazy

02:14 <companion_cube> lazyness just allows you to write some stuff like `>>` and `&&` more easily, at the cost of bad perf by default and space leaks

02:15 <companion_cube> `f : 'a -> IO 'b` isn't inherently lazy, what matters is that all side effects are thus encapsulated

02:15 <dh`> laziness is meaningless in coq because it's _actually_ pure so it doesn't matter when anything's eval'd

02:15 <companion_cube> no I mean even if you extract Coq to OCaml and add some IO framework

02:16 <companion_cube> but for lean it's maybe more apparent since it has some native IOs?

02:16 <dh`> fair enough

02:16 alfiee has quit [Ping timeout: 248 seconds]

02:18 <discocaml> <._null._> Rocq is call-by-name, so closer to call-by-need than to call-by-value. But also, Rocq has no effects, so the difference is not noticeable (except through complexity I guess)

02:20 <discocaml> <._null._> And at the same time, extraction to OCaml means this code is run in call-by-value, because for closed terms it doesn't matter

02:21 <dh`> is it any of these things specifically? there are several different proof tactics for evaluating things

02:21 <dh`> (including "cbn" and "cbv")

02:22 <discocaml> <._null._> The kernel needs to run in cbn for it to work properly. These tactics are purely for efficiency purposes

02:22 <dh`> hmm

02:23 <discocaml> <._null._> Not entirely sure about that "needs to", but I can at least tell you that it definitely does

02:24 <dh`> hmm

02:24 <dh`> idk

02:25 <dh`> I wouldn't think that it would matter (for the reasons already cited) but there are many subtleties

02:25 <discocaml> <._null._> The question would become "what is a value in an open term ?" Is `x 0` a value ? Because it's in normal form

02:25 euphores has quit [Ping timeout: 252 seconds]

02:27 <dh`> anyway you can make an IO monad where things happen at the time you call >>=, you just need to make certain operations () -> IO t instead of IO t so they don't happen prematurely

02:27 <discocaml> <._null._> If it's not a value, then cbv can't work in the kernel; if it is, then I'm curious what definition of "value" you would take

02:28 <companion_cube> like dh` says

02:28 <dh`> ISTM that 'x 0' is a value if and only if it's reducible (which is, granted, a circular definition)

02:28 <companion_cube> and even then I don't think you actually need `() -> …` that much

02:28 <dh`> but therefore it not a value if x unfolds

02:28 <companion_cube> >>= is enough

02:29 <discocaml> <._null._> (x would be a variable, so not reducible)

02:29 <dh`> no, it's fairly rare to have actions of the form IO t that don't take arguments of some kind

02:29 <companion_cube> `type 'a io = Ret of 'a | Bind : ('a io * ('a -> 'b io)) -> 'b io | Read : string io`

02:29 <companion_cube> this just needs a sort of toplevel recursive interpreter and voilà

02:29 <companion_cube> (s/Read/Readline or whatever)

02:30 <dh`> ah, and you have to substitute x to make progress? that makes sense, but I'm also not sure it constitutes CBV

02:30 <dh`> yeah but the general read takes a filehandle :-)

02:30 <companion_cube> yeah yeah you know what I mean

02:30 <dh`> right, there certainly are cases

02:30 <dh`> just relatively few

02:30 <companion_cube> `Read : fd * bytes * int * int -> int io`

02:31 <companion_cube> exact same for Write

02:32 <dh`> there's getpid: pid_t io

02:33 euphores has joined #ocaml

02:34 <companion_cube> for a real implementation we'd need an open type

02:34 <companion_cube> this is just to illustrate

02:34 <companion_cube> but the thing is, a `foo io` wouldn't do a thing until it comes into the interpreter

02:34 <companion_cube> (again unlike lwt, where promises are hot from the moment they're created)

02:34 <dh`> depends if you tried to implement it with a state

02:36 <dh`> you can still have the 'a IO = mkIO of world -> (world * 'a) formulation and be eager, but then a loose `foo io` has a state

02:36 <dh`> s/has/has to have/

02:36 ygrek has quit [Remote host closed the connection]

02:36 <dh`> but there's no reason that's necessary either, you can make the world state magic

02:36 <companion_cube> this is just a metaphor, I can't pass the world into a OCaml thread :p

02:36 <companion_cube> I'd absolutely not do that, I'd do a GADT + a `main : unit io -> unit` that runs the interpreter, I think

02:37 <dh`> anyway if you have threads you really kinda want something that reflects the thread state, which then isn't actually a monad any more

02:38 <companion_cube> why not?

02:38 <companion_cube> as long as it stays in IO

02:38 <companion_cube> `gettid : int io`, blabla

02:39 <companion_cube> `spawn : 'a io -> 'a thread io`, `join : 'a thread -> 'a io`, etc ?

02:40 <dh`> well

02:40 <dh`> haskell threads are like that

02:40 <dh`> but you give up any ability to use the not-monad structure to reason about things

02:41 <dh`> whereas operations like passing a value from one thread to another are actions of the form IO a * IO a -> IO a * IO a

02:41 <dh`> (sorta)

02:41 <dh`> and obviously this does not work with either the monad type or the monad notation

02:42 <dh`> but you could imagine a thing where instead of the single linear state of a monad gets you, you have one state chain per thread, and points where they intersect are points where the threads need to synchronize with each other

02:43 <companion_cube> I mean, this gets you a clear way to see IOs in types, not much else I think

02:43 <dh`> this gives you a directed graph and if it's cyclic you have a problem, so this gives you a pretty strong static deadlock checker if you can figure out how to write it down

02:44 <companion_cube> hmmmmmm

02:44 <companion_cube> if you can detect deadlocks statically I think it means it's too restrictive

02:44 <dh`> when I first saw the "from monads to arrows" paper I got all excited because I thought this was where they were going, but it's something else much less interesting

02:44 <dh`> maybe, maybe not

02:45 <dh`> acyclicity of that graph is probably equivalent to assigning every lock a static priority, which is necessary but not sufficient

02:45 <dh`> well, that's imprecise, but there's a sense in which it's necessary and not sufficient

02:45 <companion_cube> means you have a statically known number/topology of locks?

02:46 <dh`> yeah, but you might have multiple instances of some that you can only disambiguate at runtime

02:46 <dh`> vnode locks in a kernel being the classic example

02:47 <dh`> anyway this is an idea I mess around with occasionally but I haven't thought about it much for a while

02:50 <dh`> one of the complications is that if you want to write your thread code in anything like a normal way, that 'a IO * 'b IO -> 'b IO * 'a IO exchange operation is not one operation, it's two half-operations that go in different places in the code

02:51 <dh`> how do you write down half operations and link them to each other in a type system? seems both like it should be possible and that it isn't going to be trivial

02:52 <companion_cube> maybe structured concurrency?

02:52 <companion_cube> but I don't think it's very realistic to hope for ML-level types to solve threading issues

02:52 <companion_cube> even Rust solves data races, not deadlocks

02:52 <dh`> no

02:52 <dh`> data races aren't even a real problem

02:53 <dh`> but that's a whole other rant

02:54 <companion_cube> wat :D

02:55 <dh`> what cpu has unpredictable/undefined behavior if two cores store to the same address at once? none, one of the cores goes first, and it has been thus since the very early days

02:56 <dh`> that is, data races on machine words are not a thing

02:56 <companion_cube> isn't that unpredictable though?

02:57 <companion_cube> which one goes first

02:57 <companion_cube> and of course, what if you write multiple words

02:57 ski has quit [Remote host closed the connection]

02:57 <companion_cube> you might get invalid stuff

02:57 <dh`> not in the cpu sense of UNPREDICTABLE (which is like undefined behavior)

02:57 <dh`> if you do a _sequence_ of stores from one core and a _sequence_ of stores from another store

02:57 <dh`> er, another core

02:58 <dh`> then you might get a different ordering on each word and you get trahs out

02:58 <dh`> trash

02:58 <dh`> and that's a problem

02:58 <companion_cube> and that's a data race

02:58 alfiee has joined #ocaml

02:58 <dh`> it's a _race_

02:58 <companion_cube> in terms of the language, you get impossible values

02:58 <companion_cube> so that's just… really bad?

02:59 <dh`> two simultaneous sequences of stores are only correct if they're mutually consistent

02:59 <dh`> there are various ways you might define consistency but it's stronger than data-race freedom

02:59 Everything has quit [Ping timeout: 272 seconds]

03:00 <dh`> the upshot of which is that data-race freedom isn't useful for talking about correctness

03:01 <dh`> suppose you have an incorrect concurrent program

03:01 <companion_cube> afaict it's useful to have reasonable guarantees to understand what's going on

03:01 Everything has joined #ocaml

03:01 <companion_cube> it might not be the only way but it works really well for rust

03:01 <dh`> and you try to fix it by adding a global lock, and taking and releasing that global lock around every read or write

03:01 <dh`> poof! no more data races

03:01 <dh`> but you haven't altered the semantics of the program and it's still wrong

03:01 <companion_cube> sure

03:01 <companion_cube> but at least your program has a semantic

03:01 <companion_cube> otherwise it just wouldn't

03:02 <dh`> does it? not really

03:02 <companion_cube> sure

03:02 <dh`> the behavior depends on the interleaving of the executions

03:02 <dh`> (that's more or less the definition of race condition)

03:02 <companion_cube> yes but it remains within the boundaries of the language

03:03 <dh`> oh, yes, there's a sense in which it's safe

03:03 <companion_cube> all the possible interleavings are within these bounds

03:03 alfiee has quit [Ping timeout: 276 seconds]

03:03 <companion_cube> that's why we talk of data races and not race conditions

03:03 <companion_cube> it's just like how memory safety is not full correctness

03:03 <companion_cube> but is still a useful concept

03:03 <dh`> we talk about data races because data races are easy to identify and talk about

03:03 <companion_cube> sure

03:03 <companion_cube> and it's very valuable to prevent them

03:04 <dh`> it is very valuable to have correct concurrent programs

03:04 <dh`> I have seen no reason to think that data-race freedom offers any leverage in this

03:04 <dh`> (and keep in mind that I taught undergrads concurrency for like 15 years)

03:04 <dh`> if anything it gives people a false sense of security

03:06 <companion_cube> So we also don't need memory safety cause it's not everything?

03:08 <dh`> pragmatically? no :-p

03:08 <companion_cube> Anecdotically, writing threaded code in rust is great

03:08 <companion_cube> And the same in ocaml5 has been footgunny for me

03:08 <companion_cube> Forgot locks, etc

03:08 <dh`> anyway what I'm arguing is not that languages should have unsafe concurrency, it's that data-race freedom isn't sufficient

03:09 <companion_cube> Nobody said it's the panacea

03:09 <dh`> the RESF would have you believe it is

03:09 <companion_cube> But given that barely any language provides even that

03:09 <companion_cube> No, they're careful to say it doesn't prevent all race conditions

03:09 <dh`> also twenty years of academic papers give that impression to the naive

03:10 <dh`> my conclusion is that you either want lightweight messages, transactions, or maybe both, and without these it's always going to be a shitshow

03:11 <dh`> the RESF is never careful to say anything :-)

03:11 <dh`> but the rust book is pretty disappointing on this point

03:11 <companion_cube> You could still make multiple transactions and be wrong, no?

03:12 <dh`> you can't make it impossible to write wrong code

03:12 <companion_cube> Hu. I remember this specific distinction being underlined, weird

03:12 <dh`> maybe it's been improved since I last looked at it

03:13 <dh`> (around a year ago I think)

03:13 <dh`> the version I saw didn't even mention the concept of condition variables

03:15 <dh`> but anyway, my vague ideas about parallel not-monads are in the same category, things you might be able to use to get some leverage on the real problem

03:17 <thizanne> data race freedom offers you some form of sequential consistency, and that's often a big deal to avoid writing incorrect concurrent programs already

03:17 <companion_cube> Yeah, you won't forget the lock or atomic

03:17 <dh`> sequential consistency is neither necessary nor sufficient

03:17 <thizanne> depends

03:18 <companion_cube> You might still split critical sections too much(?) but that's more rare an error

03:18 <thizanne> in theory, one can write correct programs in weakly consistent models

03:18 <thizanne> in practice, I've seen people try.

03:18 <thizanne> hoping that the average programmer does it on the average program is absolutely delusional in 2025

03:19 <companion_cube> What does that even look like?

03:19 <thizanne> madness, mostly

03:19 <dh`> normal code with locks

03:19 <thizanne> well if you have locks then you have no data races

03:19 <companion_cube> Normal code with locks, in rust, works just fine

03:19 <dh`> it's only the lock implementation that needs to care about the memory model, unless you're doing exotic things with lock-free data structures

03:20 <companion_cube> Oh yeah people do that for queues, too

03:20 <companion_cube> At least people who write schedulers and the likes

03:20 <dh`> if you are doing exotic things with lock-free data structures

03:20 <companion_cube> Or channels

03:20 <dh`> you probably care far too much about performance to want the system to provide you sequential consistency

03:20 <companion_cube> But there's a good book on atomics in rust

03:20 <thizanne> of course you don't want sequential consistency from the system, otherwise we'd just have it

03:21 <dh`> yes

03:21 <thizanne> but you also definitely want some form of it for the end programer

03:21 <companion_cube> Good news is, code can be reused and libraries exist

03:21 <thizanne> and most of the time, yes, that means using locks on everything that can have a data race, effectively "restoring" data race freedom

03:22 <dh`> so basically in the case where you might need to think specifically about data races, whether data-race freedom buys you back sequential consistency is irrelevant

03:22 <companion_cube> And Rust allows you to make sure that once you send a value in a channel, you can't access it anymore

03:22 <dh`> and in ordinary lock-based code it's irrelevant

03:22 <dh`> er, in ordinary lock-based code data-race freedom is irrelevant

03:22 <thizanne> my point is, ordinary lock-based code is ordinary precisely because data races are too hard to reason about

03:23 <companion_cube> No no no, in ordinary lock based code it's super important to help you *not forget the locks*

03:23 <dh`> no, ordinary lock-based code is ordinary because non-lock-based code is too hard

03:23 <thizanne> even if you have a correct semantics for those -- which last time I looked wasn't even the case in the C model

03:23 <dh`> data races are not the core of that hardness

03:23 <thizanne> it's too hard for several reasons, one of them being that data races are a mess

03:24 <dh`> you can make a lock-free queue or whatever entirely out of machine integers where data races are moot

03:24 <dh`> it's still hard

03:24 <companion_cube> Concurrent access to mutable data is the core of that hardness

03:24 <thizanne> even if you're only manipulating words, not having locks on a program running on a weak model means many things can happen that you won't have considered

03:24 <companion_cube> And most programs work on larger values than just words

03:24 <dh`> yes, and that's not about data races, it's about ordering considerations

03:24 <thizanne> thread { char x = 1; register char r0 = y } ;; thread { char y = 1; register char r1 = x }

03:25 <thizanne> that thing doesn't have lock and will sometimes end with r0 and r1 both being still equal to 0

03:25 <dh`> yeah yeah, any code like that is intentionally borrowing toruble

03:25 <thizanne> (assume x and y start at 0)

03:25 <thizanne> and it's because you have a data race

03:25 <dh`> nobody besides researchers in memory models writes code like that

03:25 <thizanne> you'd be surprised

03:26 <dh`> well

03:26 <dh`> ok, people write ill-advised code all the time

03:26 <thizanne> people everywhere believe that "if it fits in a word then it must be atomic"

03:26 <dh`> maybe I should say nobody who has any idea what they're doing writes code like that

03:26 <dh`> it _is_ atomic

03:26 <thizanne> well, depends on your definition of atomic

03:26 <dh`> atomic means the access is indivisble

03:27 <dh`> or indivisible

03:27 <dh`> which is still the case, however insane the cpu's ordering semantics are

03:27 <companion_cube> Causality is useful to reason about programs, isn't it

03:27 <companion_cube> thizanne: that's not even true on arm is it?

03:28 <thizanne> what if your store operation to an "atomic" location is not seen at the same time by different cores

03:28 <thizanne> is that still atomic?

03:28 <companion_cube> I know x86 is stronger than it has to

03:28 <dh`> yes

03:28 <dh`> it's atomic, but not consistent

03:28 <companion_cube> 😂 But no worries that's not useful to program

03:28 <dh`> or to be precise, the store and the reads that observe the store aren't mutually consistent

03:29 <dh`> a _sequence_ of operations may not be atomic

03:29 <thizanne> but then the store doesn't really happen once

03:29 <dh`> sure it does, it doesn't come undone in any cpu I've ever heard of

03:30 <thizanne> it doesn't come undone, but between the point where it's initiated and the point where it's completed, many things have happened

03:30 <dh`> that's about ordering of multiple operations, the store itself is still atomic

03:30 <thizanne> that's some definition of atomic, which isn't the only one

03:30 <dh`> that's the standard one

03:31 <thizanne> not really

03:31 <thizanne> the standard historical one kind of assumes sequential consistency, which makes it equivalent

03:32 <dh`> no? unless you're talking about atomicity of sequences of stores

03:32 <dh`> and you have to bring in isolation as well to really have a well-defined concept

03:33 <dh`> and we're back to transaction semantics

03:33 <thizanne> the atomic operations defined on modern (ie weak) memory models, as far as I'm aware, all imply that the operation completes atomically, ie for everyone

03:33 <dh`> I'm talking about atomicity as a concept

03:34 <dh`> and while many cpus define a total global ordering on atomic instructions, I don't think all do

03:35 <dh`> my recollection is that with riscv atomics you have to set the memory barrier bits to get that behavior

03:35 <dh`> (but it's been a while since I looked at the riscv spec and I might be confusing this with something else)

03:37 <dh`> and in any case you don't need that global ordering for correct execution of lock-based code

03:37 <thizanne> I don't really know that model but that's not my understanding from a quick look, my understanding is that here again atomic implies multi-process synchronised operations (as it should, because why would you ever have atomic word-sized operations otherwise)

03:38 <dh`> one reason to use same-cpu atomic operations is to synchronize with interrupt handling code

03:38 <thizanne> no of course you don't, because locked code is forcefully data race free, meaning that there is no weak consistent behaviour happening

03:39 <dh`> sure there is

03:39 <thizanne> no there isn't

03:39 <dh`> there are only barriers on entry and exit to critical sections

03:39 <thizanne> every weakly consistent model says you're SC if you're data race free

03:39 <dh`> within a critical section things can happen however

03:40 <thizanne> on one thread only

03:40 <thizanne> so there isn't even concurrent behaviours happening there

03:40 <thizanne> companion_cube | thizanne: that's not even true on arm is it?

03:40 <thizanne> you mean the word thing? I don't know

03:40 <dh`> so nobody can observe that store 2 reaches global visibility before store 1; that doesn't mean it doesn't happen

03:41 <dh`> anyway I think we're chopping logic rather than having a productive conversion :-(

03:41 <thizanne> who cares what happens in the silicon, we're talking program semantics

03:42 <thizanne> I guess a more pedantic way of saying this is, if your program is data race free then there is no valid trace that isn't SC-valid

03:42 <thizanne> whatever your underlying model is

03:42 <dh`> sure

03:42 <thizanne> (as long as it's weakly consistent)

03:42 terrorjack has quit [Quit: The Lounge - https://thelounge.chat]

03:42 <dh`> and what I was saying before is that this isn't much help in assessing whether your traces are correct relative to your intended program behavior

03:42 <thizanne> and my point is -- SC-valid traces are the only one that you can expect the average programmer to handle

03:43 <dh`> no, the average programmer shouldn't be doing any of this at all

03:43 <thizanne> meaning that even though your program only involves word-sized operations that you believe are atomic, then you usually need to lock them anyway

03:43 alfiee has joined #ocaml

03:43 <thizanne> well truth to be told, I agree that the average programmer shouldn't be doing any of concurrent programming

03:43 <dh`> yes, of course you do, because in general correctness demands consistency across multiple loads and stores

03:43 <thizanne> but I'm not sure that's a very reasonable expectation

03:44 <thizanne> (I don't think the average programmer should do programming, actually -- but well)

03:44 <dh`> the average programmer might be able to handle threaded code with locks, maybe

03:44 terrorjack has joined #ocaml

03:44 <dh`> but lock-free stuff? hell no

03:44 <thizanne> dh` | yes, of course you do, because in general correctness demands consistency across multiple loads and stores

03:45 <thizanne> yes, and the best way to achieve this is by enforcing data-race freedom, and that requires lock even if you think you don't have data races "because that's only words"

03:45 <dh`> no, because data-race freedom doesn't make your code correct.

03:45 <thizanne> it isn't sufficient

03:45 <dh`> thinking about consistency and isolation and atomicity does

03:45 <dh`> it's neither necessary nor sufficient

03:46 <thizanne> that's like saying "type systems aren't useful because they don't make your code correct"

03:46 <thizanne> it's not necessary theoretically, it is in practice because the alternative is unreasonable

03:46 <dh`> no, it's like saying Standard Pascal's type system isn't useful because it doesn't have a good tradeoff between making the code correct and being usable

03:47 <thizanne> you already said that reasoning about anything that isn't SC is out of reach of programmers

03:47 <dh`> there was a bunch of work on atomic blocks some years back that never really went anywhere, which was a pity

03:47 <dh`> yes, reasoning about SC is also out of reach

03:47 alfiee has quit [Ping timeout: 252 seconds]

03:47 <dh`> reasoning about transactions, however, is a lot more doable

03:47 <thizanne> reaching SC can then be done either by carefully inserting fences or the equivalent, which amounts to reasoning in your weak model (so unreasonable); or by locking everything to get data race freedom and thus SC

03:48 <dh`> and then the code is still wrong because SC is still too hard

03:48 <thizanne> that's another point

03:48 <dh`> that's the whole point

03:48 <thizanne> not really

03:48 <thizanne> of course programming is hard

03:48 <thizanne> and concurrent programming adds another difficulty

03:49 <thizanne> but at some point you need to accept some of that difficulty or you're never doing anything beyond hello world (and even then)

03:49 <dh`> data-race freedom is not a good tool for assessing whether your concurrent code is correct

03:49 <dh`> is that a better way of putting it?

03:49 <thizanne> ...yes but that's not its role

03:50 <thizanne> not having data-race freedom is a recipe for incorrect code

03:50 <dh`> only in the general case

03:50 <thizanne> by contraposition, data race freedom is a required tool to make correct code. As most of these tools are, it won't be enough

03:50 <dh`> there are quite a few special cases where that's not true

03:50 <dh`> and that's one of the problems with the whole thing

03:50 <thizanne> yes I think we agreed on that

03:50 <dh`> stuff like statistics counters, for example

03:51 <dh`> you can do an unlocked increment and once in a while it'll bobble one and you probably don't care

03:51 <thizanne> not having data-race freedom is a recipe for incorrect code unless you know how to code in weak memory models, which is probably < 100 people in the world

03:51 <companion_cube> And there it's fine to use an atomic even in application code

03:51 <dh`> but oh now it's UB in C11 because C11 is silly

03:51 <companion_cube> But it's opt-in

03:52 <dh`> that's _not_ an atomic, is the point

03:52 <companion_cube> I'd use an atomic, why wouldn't I?

03:52 <companion_cube> Just with a chill ordering

03:52 <dh`> because the unlocked increment doesn't do cache synchronization and so it'll be noticeably faster when contended?

03:53 <dh`> this stuff unfortunately matters in real highly concurrent code like kernels

03:53 <companion_cube> Depends on the architecture no?

03:53 <dh`> not much

03:53 <companion_cube> I thought on x86 it'd be atomic anyway

03:53 <dh`> only if you do LOCK INC or whatever it is

03:53 <companion_cube> Although maybe not if there's multiple cups?

03:53 <thizanne> your point kind of amounts to "programmers that don't need correctness don't need the tools to write correct programs"

03:53 <dh`> you get a total store ordering, that doesn't prevent READ READ WRITE WRITE

03:53 <companion_cube> CPUs*

03:53 <thizanne> which is undubitably true I guess

03:54 <dh`> sort of

03:54 <companion_cube> I'll let the experts opt in into doing imprecise but fast things

03:54 <dh`> sure

03:55 <companion_cube> I personally would rather use an atomic and know there's a good semantic

03:55 <dh`> they will appreciate it if you don't go gratuitously make their code undefined

03:55 <companion_cube> They can always write a bit of asm ig?

03:55 <dh`> sometimes

03:56 <dh`> my feeling is that lock-free stuff should in fact be written in asm

03:56 <companion_cube> Maybe not everyone is a kernel dev or a concurrency expert, anyway

03:56 <dh`> no

03:56 <companion_cube> I'd rather get help from the language to tell me I did an unprotected read or write on shared data

03:56 <dh`> which gets back to my other point, which is that ordinary folks much prefer transaction semantics when they can get it (once they understand what it's all about anyway)

03:57 <companion_cube> OCaml doesn't (yet)

03:57 <companion_cube> Maybe, but isn't it harder to reason about perf? And your code needs to be able to rerun?

03:57 <companion_cube> If we're talking STM

03:57 <thizanne> lock free stuff can be written in any language where you have an actual semantics for data races

03:57 <dh`> every STM I've seen is crap :-(

03:58 <thizanne> C has (a broken and unsound) one, why not use it if that's your codebase language

03:58 <companion_cube> But then is there an example of what you'd actually like?

03:58 <dh`> they are all trying to magically infer the granularity of locking, and that fundamentally doesn't work and never will, so they have huge overheads

03:58 <thizanne> I guess that's what static analysers are for companion_cube

03:58 <dh`> we had a memory transaction system with 5-10% overhead back in the 90s

03:59 <companion_cube> Cause from what I see there's either rust (shared memory but manageable) or erlang (share nothing)

03:59 <companion_cube> And the rest is very error prone

03:59 <dh`> the transaction stuff was open-coded in C because that was what we had

03:59 <dh`> there is not, I rant these rants to try to get people to think broadly

04:00 <dh`> in the hopes that someone will make something better sometime

04:00 <companion_cube> I'm not a researcher in that so I'm mostly interested in the existing stuff

04:00 <companion_cube> JST's fork of ocaml tries to catch up with rust and that's exciting, for example

04:00 <dh`> lightweight channels with shared-nothing are the best model, I'm pretty sure

04:01 <dh`> but erlang itself has a lot of issues

04:01 <dh`> golang also has channels, but it unfortunately also has issues

04:01 <companion_cube> Rust (+ tokio) also has channels :)

04:02 mange has quit [Remote host closed the connection]

04:02 <dh`> rust has a different pile of issues :-)

04:02 <companion_cube> (and issues I suppose. But async rust pushes towards channels for some reason)

04:02 <dh`> however, it's definitely possible to have strong transaction semantics without it being ruinously expensive

04:03 <dh`> even with redo/undo

04:03 <dh`> the problem with this model is that actions you can't undo have to be deferrable to transaction commit time

04:03 <dh`> which is mostly ok but sometimes problematic

04:05 <companion_cube> I think Vesa has something like that? Lightweight transactions

04:06 <dh`> I don't know, I haven't gone and caught up with the literature for a while

04:06 <dh`> the system I referred to above got published in 1996, I did more work later but that ended up being a casualty of the 2008 funding environment

04:08 <dh`> google scholar doesn't recognize that reference, alas

04:09 <dh`> also there's an additional set of unanswered questions about transaction nesting and abstraction boundaries

04:09 <dh`> I'm reasonably certain I understand how it _should_ work but there's a good solid year or two of work to write it all down coherently

04:10 <dh`> which is probably never going to happen at this point :-|

04:11 <dh`> maybe I should look at implementing this kind of transaction system in ocaml

04:12 <dh`> last thing I need is another project that size :-(

04:14 <companion_cube> Well look at Vesa's maybe, his gh account is polytypic

04:14 <companion_cube> Iirc the project is kcas

04:15 <dh`> oh, thought that was a project name :-/

04:17 <dh`> found it

04:17 <dh`> my initial guess based on reading the description is that it's probably still ruinously expensive, just not as bad as some of the early STM attempts

04:17 <dh`> (many of which were things like "orders of magnitude slower")

04:18 <dh`> but that's only a guess, will need to look more

04:18 <dh`> and it's getting late so I need to disappear :-(

04:18 <dh`> feel free to prod me about this more later if you want

04:30 alfiee has joined #ocaml

04:34 alfiee has quit [Ping timeout: 252 seconds]

05:15 <discocaml> <diligentclerk> @companion cube I looked up IO in Lean and it says: "IO action transforms the whole world. IO actions are actually pure, because they receive a unique world as an argument and then return the changed world. This perspective is an interior one that matches how IO is represented inside of Lean. The world is represented in Lean as a token, and the IO monad is structured to make sure that each token is used exactly once."

05:15 <discocaml> <diligentclerk> Another quote from the "Functional Programming in Lean" book is "Writing a line of text to standard output is a pure function, because the world that the function returns is different from the one that it began with. Programs do need to be careful to never re-use the world, nor to fail to return a new world—this would amount to time travel or the end of the world, after all. Careful abstraction boundaries can make this style of program

05:16 <discocaml> <diligentclerk> It reads to me like Lean has a linear type system for tracking effects?

05:16 alfiee has joined #ocaml

05:20 <discocaml> <diligentclerk> You asked "Even if you want an IO monad, why would you ever make it lazy?" so I am wondering if there is a practical way to implement an IO monad which is not lazy in OCaml. This paragraph makes it sound like Lean uses linear types to have a strict IO monad, and AFAIK there's no way to implement linear types in OCaml

05:21 alfiee has quit [Ping timeout: 268 seconds]

06:02 alfiee has joined #ocaml

06:06 alfiee has quit [Ping timeout: 260 seconds]

06:16 ski has joined #ocaml

06:47 alfiee has joined #ocaml

06:52 alfiee has quit [Ping timeout: 260 seconds]

07:33 alfiee has joined #ocaml

07:36 Serpent7776 has joined #ocaml

07:38 alfiee has quit [Ping timeout: 276 seconds]

08:05 bartholin has joined #ocaml

08:12 alexherbo2 has joined #ocaml

08:18 alfiee has joined #ocaml

08:23 alfiee has quit [Ping timeout: 252 seconds]

08:28 Serpent7776 has quit [Ping timeout: 244 seconds]

08:44 Haudegen has joined #ocaml

08:53 Serpent7776 has joined #ocaml

09:04 alfiee has joined #ocaml

09:08 alfiee has quit [Ping timeout: 246 seconds]

09:16 Everything has quit [Ping timeout: 276 seconds]

09:25 mange has joined #ocaml

09:30 alexherbo2 has quit [Remote host closed the connection]

09:49 bartholin has quit [Quit: Leaving]

09:50 alfiee has joined #ocaml

09:55 alfiee has quit [Ping timeout: 276 seconds]

10:15 alexherbo2 has joined #ocaml

10:36 alfiee has joined #ocaml

10:37 alexherbo2 has quit [Remote host closed the connection]

10:40 alfiee has quit [Ping timeout: 252 seconds]

11:19 szkl has joined #ocaml

11:22 alfiee has joined #ocaml

11:26 alfiee has quit [Ping timeout: 248 seconds]

11:44 alexherbo2 has joined #ocaml

11:47 <discocaml> <uberpyro181> I'm fairly sure that as long as there's no way to escape an IO action, e.g. some function with a type like `'a io -> 'a`, then linearity (really uniqueness) is enforced on every reference to an io object

11:47 <discocaml> <uberpyro181> e.g. the cps style enforces uniqueness on every reference

11:47 <discocaml> <uberpyro181> so with the monad-like handling of io actions there's no need to have linear types on top of that, linear types allow you to safely unwrap the actions

11:48 <discocaml> <uberpyro181> well, even with linear types I think you need to use a callback style, you need essentially unique types to get rid of that

11:48 <discocaml> <uberpyro181> well, even with linear types I think you need to use a callback style, you need "essentially unique types" as in Clean to get rid of that

11:57 <discocaml> <gooby_diatonic> Very profound philosophy from the Lean manual

12:00 alexherbo2 has quit [Remote host closed the connection]

12:07 alfiee has joined #ocaml

12:12 alfiee has quit [Ping timeout: 272 seconds]

12:36 <discocaml> <_4ad> it just uses monads.

12:37 <discocaml> <_4ad> there are no linear types in Lean.

12:38 <companion_cube> @diligentclerk I find it a bit solipsistic tbh

12:39 <companion_cube> and I sketched a way to do it in OCaml without lazyness

12:39 mange has quit [Quit: Zzz...]

12:53 alfiee has joined #ocaml

12:57 alfiee has quit [Ping timeout: 252 seconds]

13:10 Haudegen has quit [Quit: Bin weg.]

13:13 ygrek has joined #ocaml

13:31 Putonlalla has quit [Ping timeout: 252 seconds]

13:40 alfiee has joined #ocaml

13:44 alfiee has quit [Ping timeout: 246 seconds]

13:55 Putonlalla has joined #ocaml

13:59 kurfen has quit [Ping timeout: 246 seconds]

14:00 kurfen has joined #ocaml

14:05 nirvdrum74 has quit [Quit: Ping timeout (120 seconds)]

14:05 nirvdrum74 has joined #ocaml

14:11 ygrek has quit [Ping timeout: 264 seconds]

14:13 ygrek has joined #ocaml

14:19 semarie has quit [Ping timeout: 248 seconds]

14:26 alfiee has joined #ocaml

14:30 alfiee has quit [Ping timeout: 244 seconds]

14:34 Haudegen has joined #ocaml

14:34 gentauro has quit [Read error: Connection reset by peer]

14:40 gentauro has joined #ocaml

14:41 semarie has joined #ocaml

15:13 alfiee has joined #ocaml

15:18 alfiee has quit [Ping timeout: 272 seconds]

15:50 steenuil has quit [Remote host closed the connection]

15:51 steenuil has joined #ocaml

15:54 steenuil has quit [Remote host closed the connection]

15:54 steenuil has joined #ocaml

15:58 alfiee has joined #ocaml

16:03 alfiee has quit [Ping timeout: 265 seconds]

16:31 wickedshell has quit [Ping timeout: 248 seconds]

16:33 <discocaml> <diligentclerk> I don't understand what this means, I truly don't understand what a strict IO monad looks like or how one might implement one. Let's say that I have a function `f` that takes a string, prints it as a message to the user, and gets an integer input in response, so it's of type `string -> int IO.t`. Now I define `g (s : string) = ignore (let %x = f s in x); 4`. The type of `g` should be `string -> int`. If the monad is lazy then since the m

16:35 <discocaml> <_4ad> strictness has nothing to do with it, you can have monads in a strict language (like OCaml).

16:35 <discocaml> <_4ad> monads just sequentialize computations, it doesn't matter whether the language is string or not.

16:36 <discocaml> <diligentclerk> I think you're misunderstanding my question because that response doesn't make any sense

16:36 Haudegen has quit [Quit: Bin weg.]

16:37 <discocaml> <diligentclerk> I understand and agree with everything you just said, but it doesn't answer my question

16:37 <discocaml> <contificate> > monads just sequentialize computations

16:37 <discocaml> <contificate> monads taking the credit for what CPS gives you

16:38 <discocaml> <diligentclerk> I was looking into cool CPS applications last night after you brought it up and naturally Oleg Kiselyov has a page on it

16:38 <discocaml> <contificate> yeah, Oleg is a CPS, shift/reset, etc. chad

16:38 <discocaml> <contificate> the coolest application of CPS is probably in defunctionalising CPS to derive abstract interpreters, but trampolining, schedulers, being the basis of monadic style, faithful compiler lowering, relation to SSA, etc. is neat as well

16:39 <discocaml> <_4ad> and btw, saying that Lean is strict is not quite right. for a total language (like Lean), call by name (need) vs. call by value makes no semantic difference whatsoever, they are exactly the same (it can make a difference in performance or whatever).

16:41 <discocaml> <diligentclerk> Interesting. I would have thought there was a semantic difference when it comes to an expression like (lambda x : unit.4)(print_endline "Hello, world")

16:43 <discocaml> <diligentclerk> I would describe the "call by name" semantics of this as having no side effect, and the "call by value" semantics as having a side effect

16:45 alfiee has joined #ocaml

16:49 <companion_cube> afaik Lean compile to call by value, is what I meaan

16:49 <companion_cube> mean*

16:49 <discocaml> <eval.apply> it's not about lazy vs strict, the side effect wouldn't happen on it's own you, and you can't just "ignore" the monad wrapper, bind is `'a t -> ('b -> 'a t) -> 'a t` you would have a "runIO" that interprets the IO actions in the monad you built up

16:49 alfiee has quit [Ping timeout: 272 seconds]

16:49 <companion_cube> exactly, if you ignore the IO value nothing happens

16:50 <discocaml> <eval.apply> now if you want that interpreter to execute in-place as if you were shooting off rockets in a normal imperative way, you can do that if your compiler is smart enough

16:50 <discocaml> <eval.apply> or if you have linear types enforce it and just make that how it works

16:50 <discocaml> <eval.apply> but that's an optimization

16:55 <discocaml> <eval.apply> now do i edit and face the wrath of the IRC users or do i wait until someone decapitates me for the typo in that bind type

16:57 <discocaml> <contificate> they'll say nothing 🔫

17:05 <discocaml> <barconstruction> I think what companion cube just said is helpful, because I might have been misunderstanding what lazy/strict means in this context.

17:08 <discocaml> <barconstruction> By "strict IO monad" I meant that the side effect associated to an IO value is performed at the time that the IO value is defined.

17:11 <dh`> that is what I'd been thinking as well

17:11 <discocaml> <contificate> does such a thing even exist

17:12 <discocaml> <contificate> isn't the IO monad always backed by some kind of opaque state transfromer

17:12 <discocaml> <contificate> isn't the IO monad always backed by some kind of opaque state transformer

17:14 <dh`> it doesn't have to be

17:15 <companion_cube> @barconstruction it's not monadic then

17:16 <companion_cube> for me an IO monad _must_ only do IOs when it goes through the runIO interpreter

17:16 <companion_cube> otherwise it's just lwt-style hot promises

17:17 <companion_cube> hmmmI guess this is terminology from observables

17:18 <dh`> yeah, I was about to say there's a problem if a loose 't IO causes something to happen without being bound

17:18 <dh`> violates the sequencing axioms

17:19 <companion_cube> and also makse `repeat 5 some_io` not actually repeat the IO

17:19 <companion_cube> which is … not great

17:26 <discocaml> <barconstruction> That makes sense.

17:27 <dh`> for the same reason you also can't just have bind execute the effects

17:28 <discocaml> <barconstruction> Right.

17:29 <dh`> banning the existence of values of type 'a IO might make it work, but that would also get real weird real fast

17:30 alfiee has joined #ocaml

17:33 <discocaml> <barconstruction> so `'a Lazy.t` is a monad, right? I can define `return x = Lazy.from_thunk (fun ()->x)` and I can define `bind x f = Lazy.from_thunk (fun () -> let a = Lazy.force x in Lazy.force (f a)`

17:34 <discocaml> <barconstruction> I would call this a call by value evaluation order. So is this a strict monad

17:34 alfiee has quit [Ping timeout: 244 seconds]

17:43 bartholin has joined #ocaml

17:48 Haudegen has joined #ocaml

17:55 euphores has quit [Quit: Leaving.]

18:00 euphores has joined #ocaml

18:15 alfiee has joined #ocaml

18:20 alfiee has quit [Ping timeout: 248 seconds]

18:25 dylanj_ has joined #ocaml

18:29 dylanj has quit [Ping timeout: 248 seconds]

18:29 dylanj_ is now known as dylanj

18:37 wickedshell has joined #ocaml

19:02 alfiee has joined #ocaml

19:05 dylanj has quit [Remote host closed the connection]

19:06 dylanj has joined #ocaml

19:06 alfiee has quit [Ping timeout: 248 seconds]

19:23 Haudegen has quit [Quit: No Ping reply in 180 seconds.]

19:24 Haudegen has joined #ocaml

19:48 alfiee has joined #ocaml

19:52 alfiee has quit [Ping timeout: 252 seconds]

20:34 alfiee has joined #ocaml

20:38 alfiee has quit [Ping timeout: 252 seconds]

20:47 Anarchos has joined #ocaml

21:16 YuGiOhJCJ has joined #ocaml

21:21 alfiee has joined #ocaml

21:21 cawfee has quit [Ping timeout: 244 seconds]

21:22 ski has quit [Ping timeout: 260 seconds]

21:25 cawfee has joined #ocaml

21:25 alfiee has quit [Ping timeout: 260 seconds]

22:07 alfiee has joined #ocaml

22:11 alfiee has quit [Ping timeout: 252 seconds]

22:11 mange has joined #ocaml

22:12 ygrek has quit [Remote host closed the connection]

22:13 Tuplanolla has joined #ocaml

22:23 Serpent7776 has quit [Ping timeout: 252 seconds]

22:24 bartholin has quit [Quit: Leaving]

22:28 rgrinberg has joined #ocaml

22:41 exfalsoquodlibet has joined #ocaml

22:52 alfiee has joined #ocaml

22:56 Anarchos has quit [Quit: Vision[]: i've been blurred!]

22:57 alfiee has quit [Ping timeout: 252 seconds]

23:39 alfiee has joined #ocaml

23:43 alfiee has quit [Ping timeout: 252 seconds]

23:44 rgrinberg has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

23:48 rgrinberg has joined #ocaml

23:53 rgrinberg has quit [Ping timeout: 244 seconds]

23:55 ygrek has joined #ocaml