Normalizzazione dati

A standard approach is to scale the inputs to have mean 0 and a variance of 1. Also linear decorrelation/whitening/pca helps a lot.

1- Min-max normalization retains the original distribution of scores except for a scaling factor and transforms all the scores into a common range [0, 1]. However, this method is not

robust (i.e., the method is highly sensitive to outliers.

2- Standardization (Z-score normalization) The most commonly used technique, which is calculated using the arithmetic mean and standard deviation of the given data. However, both mean and standard deviation are sensitive to outliers, and this technique does not guarantee a common numerical range for the normalized scores. Moreover, if the input scores are not Gaussian distributed, this technique does not retain the input distribution at the output.

3- Median and MAD: The median and median absolute deviation (MAD) are insensitive to outliers and the points in the extreme tails of the distribution. therefore it is robust. However, this technique does not retain the input distribution and does not transform the scores into a common numerical range.

4- tanh-estimators: The tanh-estimators introduced by Hampel et al. are robust and highly efficient. The normalization is given by

where μGH and σGH are the mean and standard deviation estimates, respectively, of the genuine score distribution as given by Hampel estimators.

Therefore I recommend tanh-estimators.

reference https://www.cs.ccu.edu.tw/~wylin/BA/Fusion_of_Biometrics_II.ppt

Evernote consente di ricordare tutto e di organizzarti senza sforzo. Scarica Evernote.

Comments

Word2vec - Sentiment Analysis

Word2vec è un modello (evoluzione Doc2vec) https://sourcedexter.com/tensorflow-text-classification/ https://github.com/tflearn/tflearn Vector space models (VSMs) represent (embed) words in a continuous vector space where semantically similar words are mapped to nearby points ('are embedded nearby each other') Word2vec is a particularly computationally-efficient predictive model for learning word embeddings from raw text. It comes in two flavors, the Continuous Bag-of-Words model (CBOW) and the Skip-Gram model (Section 3.1 and 3.2 in Mikolov et al. ) Skip Gram Model http://mccormickml.com/2016/04/19/word2vec-tutorial-the-skip-gram-model/ Bag-Of-Words Model https://ongspxm.github.io/blog/2014/12/bag-of-words-natural-language-processing/ Il modello è semplice perchè usa un trucco del auto-encoder l'output layer fa deve ritornare la stessa cosa dell'input (apprendimento supervisionato anche se non si hanno le label) poi si rimuove il livello di output e si ottengono i p...

Pastorella di Natale fatta con rotolo di carta igienica pallina da ping pony per il presepe dell'asilo.

Monitor in tempo reale - Prometheus ed altri oggetti.

Prometheus - Monitoring Tool - Centos7 Installazione di Prometheus Alertmonitor Grafana Node-exporter con docker come servizio e cartelle montate localmente per mantenere le configurazioni ed i dati dei servizi. La configurazione comprende anche l'uso di Telegram per le notifiche Azioni Prerequisiti (Docker Docker-Composer) Installazione immagini (Docker Pull) Configurazione avvio immagini Configurazione servizi Configurazione avvio servizi Test (configurazione ed architettura) Prerequisiti docker docker-compose Riferimenti installazione e configurazione di sistema https://www.digitalocean.com/community/tutorials/how-to-install-prometheus-using-docker-on-centos-7 http://www.tutorialspoint.com/articles/how-to-install-and-configure-prometheus-using-docker-on-centos-7 https://www.shellhacks.com/prometheus-monitoring-install-docker-ubuntu-centos/ regole di alert e queryper Prometheus https://awesome-prometheus-alerts.grep.to/rules.html https://www.fabernov...

paulo.difficiliora - un po' più difficile ?

Search This Blog