Comment on 'It's Literally the Gulag': Furious Meta Employees Speak Out on 'Soul-Crushing' AI Jobs
gravitas_deficiency@sh.itjust.works 4 days agoI’m personally playing with the idea of putting together a POC to transition our “foundation” data warehouse from PSQL to a graphDB, because the extensibility and maintainability of our current system is fucking awful. Like, some upstream entity gets a version bump and there’s like 5 systems we have to go through and add columns to various tables and occasionally fuck around with joins and so on every single time there’s a new piece of data we want to integrate. And we have no capability to scan back historically and evaluate our holistic state at some particular time index, which can be really helpful for some applications.
Anyways, I’m fucking swamped at work so haven’t touched that at all, but I’ve wanted to explore that idea for well over a year and a half at this point.
flying_sheep@lemmy.ml 4 days ago
I have little experience with graphdb, but a lot of experience with the pain you’re describing. Maintaining schemas is a pain, maybe if you don’t need the performance, you can go that route!
gravitas_deficiency@sh.itjust.works 4 days ago
The thing that interests me about it is that it will be a lot more trivially interrogable by ML stuff (bespoke ML specifically, not LLM), which could glean an absolute shitload of interesting insights for us.
I am an enormous fucking Luddite for a whole swath of reasons when it comes to LLMs, but ML outside of that context can be immensity powerful when employed correctly.
flying_sheep@lemmy.ml 3 days ago
For sure, my lab has been doing that for a long time.
How is graphdb more ML-friendly?
gravitas_deficiency@sh.itjust.works 3 days ago
If you’re doing PSQL (or any typical relational DB flavor), there’s a lot more complexity in terms of understanding the shape of the data, what joins to what, how to optimize queries, etc. Graph DBs are gonna be easier for a model to explore, since they can just do stuff like “I want to see tests with samples that have reactivity to mutation ABC on chromosome 14 over a threshold of X”, which is a lot easier for an ML agent (or less experienced developer, or even a molecular biologist with limited CS/DB experience) to just intuitively evaluate correctly using the syntax of GraphQL than it would be trying to do a shitload of joins between 6 or 7 tables in PSQL.