Skip to content
This repository was archived by the owner on Dec 22, 2019. It is now read-only.

Conversation

lowerthansound
Copy link
Owner

@lowerthansound lowerthansound commented Sep 28, 2018

  • Lint code
  • Review
  • Summarize changes

@lowerthansound lowerthansound force-pushed the duplicate branch 6 times, most recently from ec4363b to 82ab6f3 Compare September 29, 2018 22:13
araraonline added 2 commits September 29, 2018 20:40
TODO: This shall be reversed later, or be made a bit more strict. My
best choice is: ignore when it is empty of dtype object, consider if it
is empty of another dtype.

May interact somewhat with the result float64 of reindex.
Will be better made in a future version.
When there were duplicates on the columns index, sort was allowed and
duplicates were allowed if the indexes had the same values (as found by
idx.tolist()).

Now, considering that pandas doesn't allow to sort the index when there
are duplicate values (DataFrame.reindex fails) and that searching for
the same values is counter-productive and prone to fail, depending on
the different types of indexes, the behavior was modified to this:

- When sort=True and there are duplicates in at least one index, an
  error is raised and append stops.
- Dframes with duplicate indexes are only considered to be joined when
  the indexes share the same identity (that is, they are the same object
  comparable with `idx1 is  idx2`)

Some other improvements to the code have also been made and I believe it
is better in a general mode.
@lowerthansound lowerthansound merged commit 5dfe32e into secondtry Sep 29, 2018
@lowerthansound lowerthansound deleted the duplicate branch September 29, 2018 23:44
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant