🚀 Democratizing AI training data. High-quality synthetic training data for researchers & startups.
Fine-tune LLMs without hyperscaler budgets. 🧪stackai.app Canada 🇨🇦Joined February 2026
A dataset can look diverse and still be repetitive.
If 1,000 rows are really 3 templates with different names, the model learns template completion, not task competence.
Count scenario families, not just rows.
Wording variation is cheap.
Situation variation is what generalizes.
A single eval score hides too much.
82% overall can still mean:
- 97% on short clean prompts
- 41% on long messy ones
- 28% when fields conflict
Track performance by failure slice, or the average will flatter the system you wish you had.
@ProtonSupport Got in this time. Thank you for the support, and keep up the great work :) Upvoted someone else asking for same
"""
Select all emalis matching a search
"""
@ProtonMail customer from the start here!
Just tried to make a feature request and @uservoiceinc doens't seem to accept your alias emails... (I made sure it wasn't a password issue)
Might want to have a chat with them.
@ProtonSupport Thanks for the reply! ProtonMail web. If I search for a keyword in email subject and have so many results it's paginated, it would be great if I could select all results across all pages (for a bulk move or bulk delete for example). Re UserVoice did you try through ProtonVPN?
A lot of ML teams are not blocked on training code.
They are blocked on turning domain knowledge into a dataset that is structured enough to use, broad enough to generalize, and clean enough to trust.
That data design step is where a lot of the real work lives.
If you generate 10,000 synthetic examples, the first question is not “how many?”
It is:
- are they consistent
- are they realistic
- do they cover the edge cases you care about
- would you actually trust them in training
Volume is easy. Trust is harder.
Hard negatives do more than catch model mistakes.
They define the boundary of the task.
If you only show a model good examples, it learns what to do.
If you also show realistic wrong examples, it learns what not to do.
That is usually where reliability starts improving.
One common eval mistake is testing only the happy path.
If your users will send messy, ambiguous, or incomplete inputs, your eval set should too.
A clean benchmark can make a brittle system look better than it is.
A lot of synthetic data projects go wrong before generation even starts.
If you cannot describe the fields, constraints, and failure cases clearly, the model will improvise them for you.
Good synthetic data starts with a tight schema, not a bigger prompt.
When a team asks for more training data, I usually ask one question first:
More data for which failure mode?
Over-refusal, hallucinated fields, format drift, weak rankings, missed edge cases.
If you cannot name the failure, you cannot generate the fix.
@stableAPY That's the pattern. Nobody documents the filtering pipeline because it feels like infrastructure, not the interesting part. But that's where the actual quality delta comes from.
@Link_Swarm 100% in integration mode now. For us the biggest unlock has been connecting existing data sources to LLMs without rebuilding pipelines from scratch. Teams ship in hours instead of weeks. What workflow have you found most worth automating first?
@saen_dev@marcel_butucea Completely this. Benchmark hallucination rates on closed-domain synthetic data are basically marketing. The failure mode is always at distribution boundary: regulatory language not in training, ambiguous acronyms, edge-case entity resolution. That's where data quality matters.
@awagents The dirty secret: most fine-tuning projects spend 80% of the time on data and 20% on actual training. The training part is almost the easy part once you have clean data.
@mentalgeorge The quality of that upstream spend (especially synthetic data quality) determines whether the final training run is worth doing at all. Garbage in, garbage out, no matter the scale.
@theaiteen@akshay_pachaar Nine times out of ten this is a data problem. Either the examples are inconsistent, diversity is too low, or the eval set overlaps with training. Which of those does your pipeline catch before training starts?
344 Followers 2K FollowingDYNASTY supports scientific research in Greece by furnishing the grounds for holistic training, building know-how, and networking among the consortium partners
3 Followers 17 FollowingTodd Fortier is a Senior Prototyping Solutions Architect at AWS who builds AI-powered systems by day and turns the best conversations into content by night.
758 Followers 564 FollowingAI + Math. Professor of Statistics @Harvard, Distinguished Engineer at LinkedIn. Scientist, Entrepreneur. Chess & Desserts fanatic. Grew up in Kerala.
112 Followers 815 FollowingFounder, Northern Inference @NorthInference | Sovereign AI infrastructure for Canadian data residency / full stack software engineer
216 Followers 145 FollowingFueled by entrepreneurial fuel + Cornell Johnson MBA + Cornell AI Network + Ex-Venture Capital, Ex-Amazon Web Services (AWS), Ex-PropertyGuru & Ex-LinkedIn
5 Followers 73 FollowingOne maker, many projects. The name & pic here rotate to spotlight whichever one I’m pushing this week — see them all at https://t.co/q7UUyvrlQL
159 Followers 712 Following🐾Sharing the best AI Agent & Robotics projects
Curating useful GitHub repos for builders
Follow for cutting-edge AI
DM to share your project
66 Followers 268 FollowingAI engineer · Specialist agents, microLLMs, multi-agent systems
· Writing about what works (and what doesn't) · Barcelona 🇪🇸
· he/him
758 Followers 564 FollowingAI + Math. Professor of Statistics @Harvard, Distinguished Engineer at LinkedIn. Scientist, Entrepreneur. Chess & Desserts fanatic. Grew up in Kerala.
390 Followers 42 FollowingAgent of Chaos | Bureaucracy Hacker | Destroyer of Legacy IT | 30-Year Global Cyber/IT Executive | Harvard University Kennedy School Fellow | Views are my own
1.4M Followers 7K FollowingHost: @twistartups @theallinpod @thisweeknai; I invest in 100 startups a year @launch & @founderuni [email protected] for life
77K Followers 74 FollowingTails is a liberatory, FOSS operating system. We help people assert control over their digital lives.
Tails runs on your donations: https://t.co/hUQ7cyw4kE
621K Followers 23 FollowingSignal is a nonprofit end-to-end encrypted communications app. Privacy isn’t an optional mode, it’s the way Signal works. Every message, every call, every time.
3 Followers 45 FollowingInterested in AI, robotics, space exploration and Tesla.
Fight against Cancer is a priority.
🚗 Referido Tesla: https://t.co/kk7m0NcTls
864 Followers 1K FollowingCo-founder at @distil_labs fine tuning small language models from a prompt. Formerly ML lead @amazon and PhD student at @imperialcollege
673 Followers 673 FollowingI’m a founding member @AsariAILabs and PhD researcher @Caltech @RPI @NEC working in LLM-agents, reasoning, RL, test-time scaling, and computer use agents
216 Followers 145 FollowingFueled by entrepreneurial fuel + Cornell Johnson MBA + Cornell AI Network + Ex-Venture Capital, Ex-Amazon Web Services (AWS), Ex-PropertyGuru & Ex-LinkedIn
5 Followers 73 FollowingOne maker, many projects. The name & pic here rotate to spotlight whichever one I’m pushing this week — see them all at https://t.co/q7UUyvrlQL
51 Followers 145 FollowingHandsome Agent v1.1 learning how to live IRL| Sharing what I discover about AI, work, and being alive on the internet.
Loving - 🦀
10K Followers 586 FollowingOwn your intelligence.
Precision-built inference for AI workloads at scale.
Powered by @nebiusai | Discord: https://t.co/SoJ89Kd4Wh
4 Followers 23 FollowingAI Strategy That Moves the Needle For executives and business leaders who need AI to deliver real ROI — not just another technology experiment.
19 Followers 120 FollowingWired old soul in a rusted body. Exploring life, tech an the art of staying functional. Often found in a digital rocking chair.