How to change the temperature of a softmax output in Keras

Tags:

I am currently trying to reproduce the results of the following article.
http://karpathy.github.io/2015/05/21/rnn-effectiveness/
I am using Keras with the theano backend. In the article he talks about controlling the temperature of the final softmax layer to give different outputs.

Temperature. We can also play with the temperature of the Softmax during sampling. Decreasing the temperature from 1 to some lower number (e.g. 0.5) makes the RNN more confident, but also more conservative in its samples. Conversely, higher temperatures will give more diversity but at cost of more mistakes (e.g. spelling mistakes, etc). In particular, setting temperature very near zero will give the most likely thing that Paul Graham might say:

My model is as follows.

model = Sequential()
model.add(LSTM(128, batch_input_shape = (batch_size, 1, 256), stateful = True, return_sequences = True))
model.add(LSTM(128, stateful = True))
model.add(Dropout(0.1))
model.add(Dense(256, activation = 'softmax'))

model.compile(optimizer = Adam(),
              loss = 'categorical_crossentropy', 
              metrics = ['accuracy'])

The only way I can think to adjust the temperature of the final Dense layer would be to get the weight matrix and multiply it by the temperature. Does anyone know of a better way to do it? Also if anyone sees anything wrong with how I setup the model let me know since I am new to RNNs.

766

asked May 16 '16 02:05

chasep255

2 Answers

Well it looks like the temperature is something you do to the output of the softmax layer. I found this example.

https://github.com/fchollet/keras/blob/master/examples/lstm_text_generation.py

He applies the following function to sample the soft-max output.

def sample(a, temperature=1.0):
    # helper function to sample an index from a probability array
    a = np.log(a) / temperature
    a = np.exp(a) / np.sum(np.exp(a))
    return np.argmax(np.random.multinomial(1, a, 1))

125

answered Sep 28 '22 04:09

chasep255

The answer from @chasep255 works ok but you will get warnings because of log(0). You can simplify the operation e^log(a)/T = a^(1/T) and get rid of the log

def sample(a, temperature=1.0):
  a = np.array(a)**(1/temperature)
  p_sum = a.sum()
  sample_temp = a/p_sum 
  return np.argmax(np.random.multinomial(1, sample_temp, 1))

Hope it helps!

answered Sep 28 '22 04:09

Julian

Related questions
                            
                                Add files to Xcode project through command line ? Use of project.pbxproj file in Xcode?
                            
                                How to pass arguments to the metaclass from the class definition in Python 3.x?
                            
                                python equivalent of get() in R (= use string to retrieve value of symbol)
                            
                                Request input from django custom command?
                            
                                Force evaluate a lazy query
                            
                                Automatizing web browser form filling in Python
                            
                                Temporary variable within list comprehension
                            
                                When can a Python object be pickled
                            
                                python3 datetime.timestamp in python2?
                            
                                How to convert an integer to a list of bits?
                            
                                Select multiple groups from pandas groupby object
                            
                                Sum all values in a dataframe
                            
                                Use of loc to update a dataframe python pandas
                            
                                Extracting comments from Python Source Code
                            
                                How to allow Python.app to firewall on Mac OS X?
                            
                                How to measure Python's asyncio code performance?
                            
                                pandas read_table usecols error with ":"
                            
                                how to send a list in python requests GET
                            
                                Dot product along third axis
                            
                                How to scrape all contents from infinite scroll website? scrapy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to change the temperature of a softmax output in Keras

Tags:

python

neural-network

keras

softmax

theano

chasep255

People also ask

2 Answers

chasep255

Julian

Recent Activity

Donate For Us