Lobsters Interview with Claudius

@Claudius maintains LispE and previously TAMGU at Naver, combining array and logic programming with Haskell features. (N.b. the wiki holds the documentation and articles.)

In this interview, we discuss Lisp and Prolog implementations, array languages, symbolic (GOFAI) and neuro-symbolic AI.


How did you discover programming, come to pursue a PhD etc.?

It's not exactly a recent…

Read more →
How can i build a High Quality dataset?

Since this is already quite concrete, I looked into it directly:


Short answer

Yes, your proposed high-level order is reasonable:

CPT -> SFT -> small final hand-written / polished SFT

But I would slightly modify it:

CPT
  -> small hand-written seed + private eval
  -> synthetic / generated SFT from cleaned raw text
  -> filtering + verification
  -> SFT

Read more →
Page 1