Carmen

Data Scientist in Vietnam.

Word Embeddings – blessing or curse in disguise?

As word embeddings become more and more ubiquitous in language applications, a key issue has likewise emerged. The ability of embeddings to learn complex, underlying relationships between words is also their greatest caveat: How do we know when we have trained a good embedding? It’s important to differentiate between a good embedding in a more general sense and …

Word Embeddings – blessing or curse in disguise? Read More »

Bias in Data Science – the Good, the Bad and the Avoidable !?

In recent years, there have been a few prominent examples of accidental bias in machine-learning applications, such as smartphones’ beauty filters (that essentially ended up whitening skin) [1] or Microsoft’s from-innocent-teen-to-racist-in-24-hours chatbot [2,3]. Examples such as these fell victim to inherently biased data being fed into algorithms too complex to allow for much transparency. Hidden bias continues …

Bias in Data Science – the Good, the Bad and the Avoidable !? Read More »