Compute hash of OpenApi document #985

MaggieKimani1 · 2022-08-17T11:36:20Z

Fixes #799

…ash value

…Api document and its property values

baywet

I don't think we should be using the GetHashCode override for multiple reasons:

it's not its intended use, especially if caller want to store the hash somewhere for later comparison
it can yield equality between unequal objects
it will vary depending on chip instruction set (x86, 64, arm, arm64)
it requires also overriding the Equals method

I'd instead recommend using a proper hashing algorithm (configurable?) like sha256 or up.

see the documentation of get hash code.
https://docs.microsoft.com/en-us/dotnet/api/system.object.gethashcode?view=net-6.0#remarks

baywet · 2022-08-17T12:55:34Z

src/Microsoft.OpenApi/Models/OpenApiDocument.cs

+        {
+            // select two random prime numbers e.g 1 and 3 and use them to compute hash codes
+            int hash = 1;
+            hash = hash * 3 + (Workspace == null ? 0 : Workspace.GetHashCode());


I think you're supposed to select different prime numbers.

MaggieKimani1 · 2022-08-19T11:35:34Z

I don't think we should be using the GetHashCode override for multiple reasons:

it's not its intended use, especially if caller want to store the hash somewhere for later comparison

it can yield equality between unequal objects

it will vary depending on chip instruction set (x86, 64, arm, arm64)

it requires also overriding the Equals method

I'd instead recommend using a proper hashing algorithm (configurable?) like sha256 or up.

see the documentation of get hash code. https://docs.microsoft.com/en-us/dotnet/api/system.object.gethashcode?view=net-6.0#remarks

I leaned more towards overriding GetHashCode() and tried getting the hash value for each of the property values to avoid collision as I believed the goal was to perform a simple object lookup as opposed to encryption which comes with its own set of complications:

You have to convert the OpenApi document object into a byte array for hash computation
We have to mark all object model types, OpenApiAny and Expression types as either Serializable or NonSerialized for the hashing to take place..

baywet · 2022-08-19T11:44:31Z

le object lookup as opposed to encryption which comes with its own set of complications:

You have to convert the OpenApi document object into a byte array for hash computation

We have to mark all object model types, OpenApiAny and Expression types as Serializable for the hashing to take place..

Nitpicking: hashing and encryption are two completely different things :) I'm not suggesting to rely on any encryption algorithm, simply on standard hashing.

The nice additional thing with using industry standard hashing is that if we document the process correctly, other libs (from other languages) could interop with it and generate the exact same hash for the same document.

Serialize and convert: well that's the beauty of it, we already have the infrastructure to serialize the document to JSON/YAML, let's use that to produce a terse representation (so white-spacing doesn't impact the result), convert to binary, hash, and voilà! You don't even need to implement changes on the whole object model anymore, just on the document or root object.

What do you think? Also @darrelmiller, any objection with this approach?

src/Microsoft.OpenApi/Models/OpenApiDocument.cs

baywet

Thanks for making the changes, a few minor comments.
We should also make sure the serialization is terse when we hash

src/Microsoft.OpenApi/Models/OpenApiDocument.cs

src/Microsoft.OpenApi.Readers/OpenApiDiagnostic.cs

src/Microsoft.OpenApi.Readers/OpenApiStreamReader.cs

This reverts commit 9772ad0.

baywet

thanks for making the changes, here are a few final remarks

src/Microsoft.OpenApi.Readers/OpenApiStreamReader.cs

src/Microsoft.OpenApi/Models/OpenApiDocument.cs

MaggieKimani1 added 4 commits August 17, 2022 13:18

Add a hashCode field to the diagnostics object to keep track of the h…

0636e4a

…ash value

Override the base GetHashCode() to compute the hash value for an Open…

500f0c7

…Api document and its property values

Simplify using statement, add tests, test file and cleanup

7a58221

Refactor failing tests

f9a32fa

MaggieKimani1 requested review from baywet, darrelmiller, irvinesunday, millicentachieng, peombwa and zengin as code owners August 17, 2022 11:36

baywet requested changes Aug 17, 2022

View reviewed changes

Merge branch 'vnext' into mk/enhancement-create-hash-code

9c0fa21

MaggieKimani1 added 5 commits August 22, 2022 14:01

Compute hash value using hashing algorithm during serialization

c7e3ed8

Clean up test

8d20075

Code cleanup

a676e37

Code cleanup

96f060b

Update public Api interface

9772ad0

github-advanced-security bot found potential problems Aug 22, 2022

View reviewed changes

src/Microsoft.OpenApi/Models/OpenApiDocument.cs Fixed Show fixed Hide fixed

baywet requested changes Aug 22, 2022

View reviewed changes

MaggieKimani1 added 5 commits August 22, 2022 16:38

Revert "Update public Api interface"

d50f857

This reverts commit 9772ad0.

Revert previous changes

8674fe3

Refactor hashing logic and add test cases

ba73777

Update public API interface

2bb383c

Remove framework display name from interface

feebffc

baywet requested changes Aug 26, 2022

View reviewed changes

Address PR feedback

2539576

baywet approved these changes Aug 26, 2022

View reviewed changes

MaggieKimani1 merged commit fb6ea09 into vnext Aug 26, 2022

MaggieKimani1 deleted the mk/enhancement-create-hash-code branch August 26, 2022 14:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compute hash of OpenApi document #985

Compute hash of OpenApi document #985

Uh oh!

MaggieKimani1 commented Aug 17, 2022

Uh oh!

baywet left a comment

Uh oh!

baywet Aug 17, 2022

Uh oh!

MaggieKimani1 commented Aug 19, 2022 •

edited

Loading

Uh oh!

baywet commented Aug 19, 2022

Uh oh!

Uh oh!

baywet left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

baywet left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Compute hash of OpenApi document #985

Compute hash of OpenApi document #985

Uh oh!

Conversation

MaggieKimani1 commented Aug 17, 2022

Uh oh!

baywet left a comment

Choose a reason for hiding this comment

Uh oh!

baywet Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

MaggieKimani1 commented Aug 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

baywet commented Aug 19, 2022

Uh oh!

Uh oh!

baywet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

baywet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MaggieKimani1 commented Aug 19, 2022 •

edited

Loading