Blog
Catch up on our real-world impact explorations and company updates.

Press Release: TDCX Group acquires SUPA to supercharge AI-enablement platform Chemin
Acquisition strengthens Chemin’s capabilities in complex AI data services amid global demand surge.

Testing AI Hiring: What Reasoning Patterns Reveal About Bias in Southeast Asia
Would an AI model make the same hiring decision if the only thing that changed was the candidate’s ethnicity?

Evaluating LLMs’ Reasoning Ability: A Connect 4 Showdown
Given the recent interest in analyzing LLMs’ reasoning abilities, we conducted tests on several LLM models by having them play Connect 4 to evaluate their reasoning ability.

AI Safety and Governance: Why It Matters More Than Ever
AI safety isn’t just a technical issue; it’s a societal one, affecting trust, governance, and the public good.

Benchmarking Bahasa Indonesia LLMs: SEA-LIONv3 vs SahabatAI-v1
In Round 2 of our LLM evaluation, we compared Model A (SEA-LIONv3) and Model B (SahabatAI-v1) to assess their performance on Bahasa Indonesia tasks.

Chemin's Advent of Code (Expanded, Curated, Verified) Dataset for Code Generation Evaluation and Training
Discover how Chemin’s Advent of Code (Expanded, Curated, Verified) Dataset accelerates code generation research.

Strengthening AI Governance in Southeast Asia
Southeast Asia is rapidly adopting Generative AI (GenAI) and Large Language Models (LLMs), unlocking new business opportunities while introducing new risks.

Preparing Code Eval Datasets: Data Cleaning and Automated Code Execution for Advent of Code with Docker and Python
This blog outlines a system for processing Advent of Code submissions written in various languages.

DeepSeek R1 Crushes Advent of Code 2024: Our Latest Code Benchmark
Large Language Models (LLMs) for code generation have taken the software development world by storm, offering automated solutions to complex coding challenges.

Chemin's Bilingual Dataset for Evaluating Reasoning Skills in STEM Subjects
Fresh out of the oven! Our team just released a bilingual multimodal dataset for evaluating reasoning skills in STEM Subjects.

Local vs Global: Testing GPT-4o-mini and SEA-LIONv3 on Bahasa Indonesia
We tested two Large Language Models (LLMs), GPT-4o-mini and SEA-LIONv3, on their handling of Indonesian-specific questions.

Press Release: TDCX and SUPA tie-up to help companies address a key barrier in generative AI adoption
Collaboration provides companies with a one-stop-solution for their data labeling needs.

Chemin Achieves SOC 2 Type II Certification: A New Level of Trust and Security for Our Customers
We are thrilled to announce that Chemin (formerly known as SUPA) has achieved SOC 2 Type II certification.

Press Release: Global Leader in AI Waste Intelligence Greyparrot.ai Expands to 89 Waste Categories, Powered by SUPA
Under every AI (Artificial Intelligence) model is a solid foundation of high-quality labeled data.

What Is Data Labeling? A Comprehensive Guide
Without properly labeled data, ML models struggle to understand key features, leading to unreliable results. This guide explores what data labeling is, how it works, different approaches, best practices, and its real-world applications.

What Happens When AI is Trained on Its Own Content?
Recent advances in generative AI image, text, and audio generation seem to promise endless potential. It’s gotten to the point where researchers are exploring the use of synthetic data generated by AI to train next generation models.

If You Want Data Annotation Done Well, Build The Right Workforce
Quality data annotation begins with building the right workforce. Building an effective, motivated workforce, however, is no easy feat—especially when annotators are often crowdsourced & technically not full-time employees.
Turn ideas into true innovation
Let your AI take flight with our proven methods to establish credible data, reliable models, and stay on par with industry moves.