Add kappa #267

AakashKumarNain · 2019-06-01T10:26:40Z

Added Cohens Kappa as a new metric.

tensorflow_addons/metrics/cohens_kappa.py

tensorflow_addons/metrics/cohens_kappa_test.py

…d_kappa add changes

WindQAQ

Hi @AakashKumarNain, thanks for the contribution! I wonder if I misunderstand something here:

Shouldn't update_state aggregate the previous statistics like what other metrics in tf.keras.metrics do? For example EDIT:

y_true = np.array([0, 1, 0, 1, 0])
y_pred = np.array([0, 1, 0, 1, 1])
m = tf.keras.metrics.BinaryAccuracy()
m.update_state(y_true, y_pred) # 1st acc: 0.8
print(m.result().numpy()) # 0.8

y_true = np.array([0, 1, 0, 1, 0])
y_pred = np.array([0, 1, 0, 1, 0])
m.update_state(y_true, y_pred) # 2nd acc: 1.0
print(m.result().numpy()) # 0.9 = (0.8 + 1) / 2

m.reset_states() # reset if needed

Seems that the implementation now only takes current input into account. Is there any reason that we have to only compute the current input's Cohen's Kappa instead of accumulating states and computing overall Cohen's Kappa? Please correct me if I misunderstand. cc @facaiy.

EDIT: #265 (comment)

AakashKumarNain · 2019-06-06T05:22:02Z

Thanks @WindQAQ . I am aware that the state needs to be accumulated but couldn't figure out how to do that. @facaiy can you elaborate a bit on how to achieve the same?

facaiy · 2019-06-06T07:03:12Z

@AakashKumarNain I think tf.kears.metric.AUC is a good example. We should save/update intermediate variables (say, confusion matrix), rather than the result (kappa score).

facaiy · 2019-06-06T07:33:04Z

Or, please refer to https://github.com/tensorflow/tensorflow/blob/25c197e02393bd44f50079945409009dd4d434f8/tensorflow/contrib/metrics/python/ops/metric_ops.py#L3824

AakashKumarNain · 2019-06-06T10:49:01Z

Thank you @facaiy for the information.

@WindQAQ @facaiy I think I have found a very elegant solution to the problem. I cannot think of anything better than this. Please take a look at this notebook. I will make the changes in PR once you are okay with the solution, though the errors from graph mode are up again in test cases.

https://colab.research.google.com/drive/10CNyrnq10RUTHssUcfSdtIyYdGnC5bGD

facaiy · 2019-06-09T04:59:37Z

@AakashKumarNain Aakash, the new solution looks good :-)

…d_kappa Mergre changes

AakashKumarNain · 2019-06-09T06:30:31Z

@facaiy Thanks Yan. Can you please help me with the errors in the test case? I have shown it in the same notebook

facaiy · 2019-06-10T09:20:45Z

Could you remove self.kappa_score variable, Aakash? It seems that we don't need it any more.

AakashKumarNain · 2019-06-10T09:31:43Z

It didn't help. Now the error is raised due to theconf_mtx variable that is dependent on the self.conf_mtx variable defined in the constructor.

facaiy · 2019-06-10T09:41:00Z

I'm not so sure. Seems that we have to initialize variables by ourselves or use metric in a keras model, please refer to metric's test cases :-)

https://github.com/tensorflow/tensorflow/blob/r2.0/tensorflow/python/keras/metrics_test.py

AakashKumarNain · 2019-06-10T10:21:07Z

I checked that and I am doing the same thing. This line self.conf_mtx.assign_add(new_conf_mtx) is causing the problem. The AssignAdd operation is throwing errors.

WindQAQ · 2019-06-10T12:14:25Z

Hi, @AakashKumarNain, you have to initialize variables within the metric object:

self.evaluate(tf.compat.v1.variables_initializer(kp_obj1.variables))

Here is the revised notebook.

AakashKumarNain · 2019-06-10T14:25:19Z

Thank you @WindQAQ for looking into it. Can you provide me the access to the notebook?

WindQAQ · 2019-06-10T14:27:30Z

Sorry about that... Already shared it. Please click the link above again.

AakashKumarNain · 2019-06-10T14:40:46Z

Got it. Thanks.

…d_kappa add changes from tf_addons-master

Squadrick · 2019-06-12T15:16:01Z

@AakashKumarNain I'm trying to figure that out. Could you tell your TF 2.x version using: tf.__version__

AakashKumarNain · 2019-06-12T15:36:26Z

@facaiy @WindQAQ @Squadrick I am on 2.0.0-beta0 . The funny thing is that now it is failing on tf.initializers, which shouldn't be the case anyway.

Another funny thing is that everything works with Py3, all test cases pass but something is always off with Py2. Py2 support is going to end in December, maybe it is the sign 😆

Either the version in Py2 is very different than it is with Py3 or something is seriously broken

seanpmorgan · 2019-06-12T15:40:08Z

@facaiy @WindQAQ @Squadrick I am on 2.0.0-beta0 . The funny thing is that now it is failing on tf.initializers, which shouldn't be the case anyway.

Another funny thing is that everything works with Py3, all test cases pass but something is always off with Py2. Py2 support is going to end in December, maybe it is the sign

Either the version in Py2 is very different than it is with Py3 or something is seriously broken

Sorry I haven't been tracking this too closely. If you're refering to why py3 tests are passing in our CI it's probably because there is no py34 tf2-nightly available so it's running on an old version. See #279

AakashKumarNain · 2019-06-12T15:43:55Z

But @seanpmorgan no matter what version, tf.initializers should be there. It is like tf.nn, an important part of the API, right?

seanpmorgan · 2019-06-12T15:50:30Z

See #278 #273 I asked but didn't look into when this API enforcement occurred. (So tf.keras.initalizers should work). The py34 test is passing on an outdated nightly I believe

seanpmorgan · 2019-06-12T15:52:46Z

I checked on py36 and there is no alias for tf.initializers anymore

Squadrick · 2019-06-12T16:04:56Z

Can confirm, tf.initializers is not found in the latest TF2.x nightly for Python3.6, but tf.keras.initializers works.

AakashKumarNain · 2019-06-12T16:16:50Z

@Squadrick I fixed it. Can you trigger the test again, please?

Outdated review.

AakashKumarNain · 2019-06-12T16:27:07Z

Thanks @Squadrick Finally! This has taken an enormous amount of time. Thanks @facaiy @WindQAQ @seanpmorgan for all the guidance and efforts.

PS: Is there a list where we are keeping track of all the removed APIs?

Squadrick · 2019-06-12T16:32:28Z

@AakashKumarNain Thanks again for the contribution.

seanpmorgan · 2019-06-12T16:32:45Z

Thanks @Squadrick Finally! This has taken an enormous amount of time. Thanks @facaiy @WindQAQ @seanpmorgan for all the guidance and efforts.

PS: Is there a list where we are keeping track of all the removed APIs?

Not that I know of, but may be worth asking. The best I would look at is https://github.com/tensorflow/tensorflow/blob/master/tensorflow/tools/compatibility/all_renames_v2.py

But that mostly uses compat.v1 as replacement. And thanks very much to you for putting in the time to get it all working! It sets a great example for stateful metrics in Addons!

llu025 · 2019-11-05T07:40:41Z

Hello,

I am using tfa.metrics.CohenKappa and something seems to be really wrong. While tf.keras.metrics.Accuracy is working fine, the CohenKappa gets values above 1 (sometimes above 2..).

I have a dictionary of metrics:

self.change_map_metrics = {
            "ACC": tf.keras.metrics.Accuracy(),
            "cohens kappa": tfa.metrics.CohenKappa(num_classes=2),
        }

and then after each training epoch I call this:

def _compute_metrics(self, y_true, y_pred, metrics):
        """
            Compute the metrics specified in metrics.
            Write results to self.tb_writer
            Input:
                y_true - tensor (n, )
                y_pred - tensor (n, )
                metrics - dict {name: tf.metrics class instance}
            Output:
                None
        """
        y_true, y_pred = tf.reshape(y_true, [-1]), tf.reshape(y_pred, [-1])
        for name, metric in metrics.items():
            metric.update_state(y_true, y_pred)
            with self.tb_writer.as_default():
                tf.summary.scalar(name, metric.result())
            metric.reset_states()

As you can see on Tensorboard, it does not really make any sense.

I am using TF 2.0 and the docker image tensorflow/tensorflow:2.0.0-gpu.
Please let me know if you need further details.

WindQAQ · 2019-11-05T07:56:16Z

@llu025 Hi, is it possible to dump y_true and y_pred in your program? Thank you.

llu025 · 2019-11-05T08:01:41Z

@llu025 Hi, is it possible to dump y_true and y_pred in your program? Thank you.

I am not really sure what you mean with "dump". If you are interested in what they contain, they are boolean, so either 0s or 1s.

WindQAQ · 2019-11-05T08:09:11Z

@llu025 Hi, is it possible to dump y_true and y_pred in your program? Thank you.

I am not really sure what you mean with "dump". If you are interested in what they contain, they are boolean, so either 0s or 1s.

I mean the exact values of them, which could make kappa score > 1.

AakashKumarNain · 2019-11-05T08:30:20Z

Yeah, it would be really helpful to debug if you provide the values that produced this score

llu025 · 2019-11-05T12:12:52Z

I printed 20 consecutive values from a random index for both y_pred and y_true, this is an example of one epoch:

I thought the problem might have been the use of boolean labels instead of integers, so I added this in my _compute_metrics function:

if y_pred.dtype == tf.bool:
            y_true, y_pred = tf.cast(y_true, tf.uint8), tf.cast(y_pred, tf.uint8)

so I dump them again

but I still get kappa above 1

WindQAQ · 2019-11-05T15:48:04Z

Thank you!
@AakashKumarNain I find some abnormal test case that could make it > 1:
https://colab.research.google.com/drive/1lA0mc0prr0XB4buKOdf44WvKi_xMXtIt
I guess it's due to overflow of int32.

AakashKumarNain · 2019-11-05T16:48:49Z

@WindQAQ Thanks for the test cases. Yeah, it seems that we need to change the dtype to int64

WindQAQ · 2019-11-05T17:02:30Z

This should be fix in #675 :D

AakashKumarNain · 2019-11-05T17:11:20Z

This should be fix in #675 :D

Thank you @WindQAQ . I checked the same on my end. Seems that dtype was the issue.
https://colab.research.google.com/drive/1vPGX11biU3CepL3wHp0tZocihu-d9fgK

AakashKumarNain added 3 commits June 1, 2019 15:53

add Cohens Kappa Metric

793cf60

add tests for Cohens Kappa Metric

b3c6e36

include Cohens Kappa and tests

56394d1

AakashKumarNain requested a review from a team as a code owner June 1, 2019 10:26

googlebot added the cla: yes label Jun 1, 2019

Squadrick reviewed Jun 3, 2019

View reviewed changes

AakashKumarNain added 4 commits June 3, 2019 19:11

Merge branch 'master' of https://github.com/tensorflow/addons into ad…

d5eb0da

…d_kappa add changes

code refactor and remove extra lines

d669b11

add separate tests for each case

142e61e

refactor code

29757da

WindQAQ reviewed Jun 3, 2019

View reviewed changes

WindQAQ mentioned this pull request Jun 3, 2019

Support for new metrics under tf.metric #265

Closed

11 tasks

facaiy added the metrics label Jun 9, 2019

Merge branch 'master' of https://github.com/tensorflow/addons into ad…

c865196

…d_kappa Mergre changes

AakashKumarNain added 2 commits June 10, 2019 21:32

Merge branch 'master' of https://github.com/tensorflow/addons into ad…

1d98713

…d_kappa add changes from tf_addons-master

make the metric stateful

ec2ed37

kokoro-team removed the kokoro:force-run label Jun 12, 2019

fix initializer

0c31db8

Squadrick added the kokoro:force-run label Jun 12, 2019

kokoro-team removed the kokoro:force-run label Jun 12, 2019

Squadrick approved these changes Jun 12, 2019

View reviewed changes

Squadrick merged commit 977d96e into tensorflow:master Jun 12, 2019

AakashKumarNain deleted the add_kappa branch June 13, 2019 10:09

WindQAQ mentioned this pull request Nov 5, 2019

Fix overflow of CohenKappa #675

Merged

Add kappa #267

Add kappa #267

Uh oh!

Conversation

AakashKumarNain commented Jun 1, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

WindQAQ left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AakashKumarNain commented Jun 6, 2019

Uh oh!

facaiy commented Jun 6, 2019

Uh oh!

facaiy commented Jun 6, 2019

Uh oh!

AakashKumarNain commented Jun 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facaiy commented Jun 9, 2019

Uh oh!

AakashKumarNain commented Jun 9, 2019

Uh oh!

facaiy commented Jun 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AakashKumarNain commented Jun 10, 2019

Uh oh!

facaiy commented Jun 10, 2019

Uh oh!

AakashKumarNain commented Jun 10, 2019

Uh oh!

WindQAQ commented Jun 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AakashKumarNain commented Jun 10, 2019

Uh oh!

WindQAQ commented Jun 10, 2019

Uh oh!

AakashKumarNain commented Jun 10, 2019

Uh oh!

Squadrick commented Jun 12, 2019

Uh oh!

AakashKumarNain commented Jun 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanpmorgan commented Jun 12, 2019

Uh oh!

AakashKumarNain commented Jun 12, 2019

Uh oh!

seanpmorgan commented Jun 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanpmorgan commented Jun 12, 2019

Uh oh!

Squadrick commented Jun 12, 2019

Uh oh!

AakashKumarNain commented Jun 12, 2019

Uh oh!

AakashKumarNain commented Jun 12, 2019

Uh oh!

Squadrick commented Jun 12, 2019

Uh oh!

seanpmorgan commented Jun 12, 2019

Uh oh!

llu025 commented Nov 5, 2019

Uh oh!

WindQAQ commented Nov 5, 2019

Uh oh!

llu025 commented Nov 5, 2019

Uh oh!

WindQAQ left a comment •

edited

Loading

AakashKumarNain commented Jun 6, 2019 •

edited

Loading

facaiy commented Jun 10, 2019 •

edited

Loading

WindQAQ commented Jun 10, 2019 •

edited

Loading

AakashKumarNain commented Jun 12, 2019 •

edited

Loading

seanpmorgan commented Jun 12, 2019 •

edited

Loading

WindQAQ commented Nov 5, 2019 •

edited

Loading

WindQAQ commented Nov 5, 2019 •

edited

Loading

AakashKumarNain commented Nov 5, 2019 •

edited

Loading