Statistics |

Introduction to Feature Engineering

Introduction In a modeling process, there are 3 core concepts that will always exist: Data. Features. Type of model and its corresponding parameters. From data to the model, features are a measurable representation of the data, which would be the format for the data to be processed by the model thus method to create features …

Introduction to Feature Engineering Read More »

Pages: 1 2 3 4 5

Introduction to Healthcare Data Science

Data Science Blog / By MaiNT

Introduction to Healthcare Data Science (Overview) Healthcare analytics is the collection and analysis of data in the healthcare field to study determinants of disease in human populations, identify and mitigate risk by predicting outcomes. This post introduces some common epidemiological study designs and an overview of the modern healthcare data analytics process. Types of Epidemiologic …

Introduction to Healthcare Data Science Read More »

Basic time – related machine learning models

Data Science Blog / By Thao

Introduction With data that have time-related information, time features can be created to possibly add more information to the models. Since how to consider time series for machine learning is a broad topic, this article only aims to introduced basic ways to create time features for those models. Type of data that is expected for …

Pages: 1 2

k-Nearest Neighbors algorithms

Data Science Blog / By HuyD

In this blog post, I am going to introduce one of the most intuitive algorithms in the field of Supervised Learning[1], the k-Nearest Neighbors algorithm (kNN). The original k-Nearest Neighbors algorithm The kNN algorithm is very intuitive. Indeed, with the assumption that items close together in the dataset are typically similar, kNN infers the output …

k-Nearest Neighbors algorithms Read More »

Pages: 1 2

Hypothesis Testing for One – Sample Mean

Data Science Blog / By PhucN

I. A Brief Overview Consider an example of a courtroom trial: A car company C is accused of not manufacturing environment-friendly vehicles. The average CO2 emission per car from different manufacturers based on a survey from the previous year is 120.4 grams per kilometer. But for a random batch of 100 cars produced at C’s …

Hypothesis Testing for One – Sample Mean Read More »

Pages: 1 2

Binomial Theorem

Data Science Blog / By kevin

Can you expand on ? I guess you would find that is quite easy to do. You can easily find that . How about the expansion of . It is no longer easy. It is no longer easy, isn’t it. However, if we use Binomial Theorem, this expansion becomes an easy problem. Binomial Theorem is a very intriguing topic …

Binomial Theorem Read More »

Monte Carlo Simulation

Data Science Blog / By phong

On a nice day 2 years ago, when I was in the financial field. My boss sent our team an email. In this email, he would like to us propose some machine learning techniques to predict stock price. So, after accepting the assignment from my manager, our team begin to research and apply some approaches …

Monte Carlo Simulation Read More »

Bayesian estimator of the Bernoulli parameter

Data Science Blog / By Vu-Duc Tran

In this post, I will explain how to calculate a Bayesian estimator. The taken example is very simple: estimate the parameter θ of a Bernoulli distribution. A random variable X which has the Bernoulli distribution is defined as with In this case, we can write . In reality, the simplest way …

Bayesian estimator of the Bernoulli parameter Read More »

N-gram language models – Part 2

Data Science Blog / By mtiAdmin

Background In part 1 of my project, I built a unigram language model: it estimates the probability of each word in a text simply based on the fraction of times the word appears in that text. The text used to train the unigram model is the book “A Game of Thrones” by George R. R. Martin (called train). …

N-gram language models – Part 2 Read More »

Pages: 1 2

N-gram language models – Part 1

Data Science Blog / By mtiAdmin

Background Language modeling — that is, predicting the probability of a word in a sentence — is a fundamental task in natural language processing. It is used in many NLP applications such as autocomplete, spelling correction, or text generation. Currently, language models based on neural networks, especially transformers, are the state of the art: they predict very accurately a …

N-gram language models – Part 1 Read More »

Pages: 1 2

Statistics

Introduction to Feature Engineering

Introduction to Healthcare Data Science

Basic time – related machine learning models

k-Nearest Neighbors algorithms

Hypothesis Testing for One – Sample Mean

Binomial Theorem

Monte Carlo Simulation

Bayesian estimator of the Bernoulli parameter

N-gram language models – Part 2

N-gram language models – Part 1

Head Office (Japan)

5th Floor, Tokyo Opera City Tower 3-20-2, Nishi-Shinjuku, Shinjuku-ku, Tokyo 163-1435, Japan

+81-3-5333-6789

HCM Office (Headquarter in Vietnam)

15th Floor, Cong Hoa Garden, 20 Cong Hoa Str, Ward 12, Tan Binh District, Ho Chi Minh City, Vietnam

028 6654 0287

Da Nang Office (Branch in Vietnam)

23rd Floor, Vietinbank Building, 36 Tran Quoc Toan St, Hai Chau 1 Ward, Hai Chau District, Danang City, Vietnam

023 6652 6468

Privacy Policy

Copyright © 2024

Statistics

Cookie