How to compute Receiving Operating Characteristic (ROC) and AUC in keras?

Tags:

I have a multi output(200) binary classification model which I wrote in keras.

In this model I want to add additional metrics such as ROC and AUC but to my knowledge keras dosen't have in-built ROC and AUC metric functions.

I tried to import ROC, AUC functions from scikit-learn

from sklearn.metrics import roc_curve, auc from keras.models import Sequential from keras.layers import Dense . . . model.add(Dense(200, activation='relu')) model.add(Dense(300, activation='relu')) model.add(Dense(400, activation='relu')) model.add(Dense(300, activation='relu')) model.add(Dense(200,init='normal', activation='softmax')) #outputlayer  model.compile(loss='categorical_crossentropy', optimizer='adam',metrics=['accuracy','roc_curve','auc'])

but it's giving this error:

Exception: Invalid metric: roc_curve

How should I add ROC, AUC to keras?

703

asked Dec 08 '16 05:12

2 Answers

Due to that you can't calculate ROC&AUC by mini-batches, you can only calculate it on the end of one epoch. There is a solution from jamartinh, I patch the codes below for convenience:

from sklearn.metrics import roc_auc_score from keras.callbacks import Callback class RocCallback(Callback):     def __init__(self,training_data,validation_data):         self.x = training_data[0]         self.y = training_data[1]         self.x_val = validation_data[0]         self.y_val = validation_data[1]       def on_train_begin(self, logs={}):         return      def on_train_end(self, logs={}):         return      def on_epoch_begin(self, epoch, logs={}):         return      def on_epoch_end(self, epoch, logs={}):         y_pred_train = self.model.predict_proba(self.x)         roc_train = roc_auc_score(self.y, y_pred_train)         y_pred_val = self.model.predict_proba(self.x_val)         roc_val = roc_auc_score(self.y_val, y_pred_val)         print('\rroc-auc_train: %s - roc-auc_val: %s' % (str(round(roc_train,4)),str(round(roc_val,4))),end=100*' '+'\n')         return      def on_batch_begin(self, batch, logs={}):         return      def on_batch_end(self, batch, logs={}):         return  roc = RocCallback(training_data=(X_train, y_train),                   validation_data=(X_test, y_test))  model.fit(X_train, y_train,            validation_data=(X_test, y_test),           callbacks=[roc])

A more hackable way using tf.contrib.metrics.streaming_auc:

import numpy as np import tensorflow as tf from sklearn.metrics import roc_auc_score from sklearn.datasets import make_classification from keras.models import Sequential from keras.layers import Dense from keras.utils import np_utils from keras.callbacks import Callback, EarlyStopping   # define roc_callback, inspired by https://github.com/keras-team/keras/issues/6050#issuecomment-329996505 def auc_roc(y_true, y_pred):     # any tensorflow metric     value, update_op = tf.contrib.metrics.streaming_auc(y_pred, y_true)      # find all variables created for this metric     metric_vars = [i for i in tf.local_variables() if 'auc_roc' in i.name.split('/')[1]]      # Add metric variables to GLOBAL_VARIABLES collection.     # They will be initialized for new session.     for v in metric_vars:         tf.add_to_collection(tf.GraphKeys.GLOBAL_VARIABLES, v)      # force to update metric values     with tf.control_dependencies([update_op]):         value = tf.identity(value)         return value  # generation a small dataset N_all = 10000 N_tr = int(0.7 * N_all) N_te = N_all - N_tr X, y = make_classification(n_samples=N_all, n_features=20, n_classes=2) y = np_utils.to_categorical(y, num_classes=2)  X_train, X_valid = X[:N_tr, :], X[N_tr:, :] y_train, y_valid = y[:N_tr, :], y[N_tr:, :]  # model & train model = Sequential() model.add(Dense(2, activation="softmax", input_shape=(X.shape[1],)))  model.compile(loss='categorical_crossentropy',               optimizer='adam',               metrics=['accuracy', auc_roc])  my_callbacks = [EarlyStopping(monitor='auc_roc', patience=300, verbose=1, mode='max')]  model.fit(X, y,           validation_split=0.3,           shuffle=True,           batch_size=32, nb_epoch=5, verbose=1,           callbacks=my_callbacks)  # # or use independent valid set # model.fit(X_train, y_train, #           validation_data=(X_valid, y_valid), #           batch_size=32, nb_epoch=5, verbose=1, #           callbacks=my_callbacks)

136

answered Oct 13 '22 18:10

Like you, I prefer using scikit-learn's built in methods to evaluate AUROC. I find that the best and easiest way to do this in keras is to create a custom metric. If tensorflow is your backend, implementing this can be done in very few lines of code:

import tensorflow as tf from sklearn.metrics import roc_auc_score  def auroc(y_true, y_pred):     return tf.py_func(roc_auc_score, (y_true, y_pred), tf.double)  # Build Model...  model.compile(loss='categorical_crossentropy', optimizer='adam',metrics=['accuracy', auroc])

Creating a custom Callback as mentioned in other answers will not work for your case since your model has multiple ouputs, but this will work. Additionally, this methods allows the metric to be evaluated on both training and validation data whereas a keras callback does not have access to the training data and can thus only be used to evaluate performance on the training data.

answered Oct 13 '22 17:10

Kimball Hill

Related questions
                            
                                Django Many-to-Many (m2m) Relation to same model
                            
                                Get file size using python-requests, while only getting the header
                            
                                Replacing a character from a certain index [duplicate]
                            
                                How to load a list of numpy arrays to pytorch dataset loader?
                            
                                Detect face then autocrop pictures
                            
                                Flask SQLAlchemy querying a column with "not equals"
                            
                                Flask logging - Cannot get it to write to a file
                            
                                Populating django field with pre_save()?
                            
                                A Python script that activates the virtualenv and then runs another Python script?
                            
                                Why do std::string operations perform poorly?
                            
                                Argparse: Check if any arguments have been passed
                            
                                Editing the date formatting of x-axis tick labels in matplotlib
                            
                                Detect if a variable is a datetime object
                            
                                BeautifulSoup - search by text inside a tag
                            
                                Add an element in each dictionary of a list (list comprehension)
                            
                                How can I prevent a window from being resized with tkinter?
                            
                                how to do a dictionary format with f-string in python 3.6?
                            
                                Under what circumstances are __rmul__ called?
                            
                                How do I find the difference between two values without knowing which is larger?
                            
                                How do I print colored output to the terminal in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to compute Receiving Operating Characteristic (ROC) and AUC in keras?

Tags:

python

keras

theano

Eka

People also ask

2 Answers

Tom

Kimball Hill

Recent Activity

Donate For Us