Bugfix/48 no storer write #51

dk1844 · 2020-12-14T13:17:51Z

Bugfix of #48.

A small integTest has been added to instill the correct behavior of this very case. Also tested with Enceladus's aws-poc and the _INFO file generation was observed without problems (without changes - explicit output path is used there anyway)

- hdfs test enabled for build, while s3 ignored - readme update

TODO test/testadd if it works with s3-over-hadoopFs like this

dk1844 · 2020-12-14T13:18:51Z

atum/src/main/scala/za/co/absa/atum/core/SparkQueryExecutionListener.scala

        case _ =>
-          Atum.log.info("No usable storer is set, therefore no data will be written the automatically with DF-save to an _INFO file.")
+          Atum.log.debug(s"SparkQueryExecutionListener.onSuccess: writing to Hadoop FS")
+          writeInfoFileForQuery(qe)


Missing the info file write here was the main cause.

Zejnilovic

Code Reviewed, Built, Ran in a stand-alone project.

Zejnilovic · 2020-12-15T11:14:13Z

examples/src/test/scala/za/co/absa/atum/HdfsInfoIntegrationSuite.scala

+    df.write.mode(SaveMode.Overwrite)
+      .parquet(outputPath)
+
+  {


I am not sure I get how this is styled. Shouldn't there be some keyword here?

what what?
The {} block is used to limit the visibility of val outputPath, as a logical constraint. Or are you discussing the formatting of the block?

Ah, interesting, haven't thought about it that way. I have never seen a standalone block like this in scala. So it was weird to me.

benedeki

Just small stuff, mostly code style.

atum/src/main/scala/za/co/absa/atum/core/SparkQueryExecutionListener.scala

examples/src/test/scala/za/co/absa/atum/HdfsInfoIntegrationSuite.scala

examples/src/test/scala/za/co/absa/atum/LocalFsTestUtils.scala

benedeki · 2020-12-17T02:23:22Z

examples/src/test/scala/za/co/absa/atum/examples/SampleMeasurementsS3RunnerExampleSpec.scala

 import za.co.absa.atum.utils._

-class SampleMeasurementsS3RunnerSpec extends AnyFunSuite
+@Ignore


Why is this here?

Because unlike the hadoop-fs tests, these tests should not be run against actual S3. Thus, they now:

serve as an example

can be run manually, provided certain conditions are met (files exist on S3 inside a specified bucket, KEY ID is supplied, local saml profile is supplied)

atum/src/main/scala/za/co/absa/atum/utils/ExecutionPlanUtils.scala

benedeki · 2021-01-04T10:55:51Z

I guess I can approve the functionality as we well.
Worked on the merge of Enceladus AWS-POC branch. That requires ATUM 3.x, but some tests failed with the currently released version. With this snapshot, they pass.

% Conflicts: % examples/pom.xml

dk1844 added 4 commits December 3, 2020 14:29

#48 no storer write fix - hadoopfs default storer is used

601c217

- hdfs test enabled for build, while s3 ignored - readme update

#48 projected ability for s3-over-hadoopFs pending measurements saving

5c8057d

TODO test/testadd if it works with s3-over-hadoopFs like this

#48 implicit saving test adding 1 (loader non-"", storer = "")

6872dfa

#48 explicit saving test adding (loader non-"", storer = defined)

c140413

dk1844 requested a review from Zejnilovic December 14, 2020 13:17

dk1844 commented Dec 14, 2020

View reviewed changes

Zejnilovic approved these changes Dec 15, 2020

View reviewed changes

benedeki requested changes Dec 17, 2020

View reviewed changes

dk1844 requested a review from benedeki January 4, 2021 10:26

#48 PR touchups (explicit types, comments, etc.)

a051dfa

dk1844 force-pushed the bugfix/48-no-storer-write branch from 6a03ada to a051dfa Compare January 4, 2021 10:31

Zejnilovic approved these changes Jan 4, 2021

View reviewed changes

benedeki approved these changes Jan 4, 2021

View reviewed changes

dk1844 added 2 commits January 5, 2021 09:32

Merge branch 'master' into bugfix/48-no-storer-write

a40d819

% Conflicts: % examples/pom.xml

#48 mergefix

61adf96

dk1844 merged commit da979ee into master Jan 6, 2021

dk1844 deleted the bugfix/48-no-storer-write branch January 6, 2021 15:09

dk1844 linked an issue Jan 7, 2021 that may be closed by this pull request

Atum 3+ does not write pending checkpoints #48

Closed

Uh oh!

Bugfix/48 no storer write #51

Bugfix/48 no storer write #51

Uh oh!

Conversation

dk1844 commented Dec 14, 2020

Uh oh!

dk1844 Dec 14, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Zejnilovic left a comment

Choose a reason for hiding this comment

Uh oh!

Zejnilovic Dec 15, 2020

Choose a reason for hiding this comment

Uh oh!

dk1844 Jan 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Zejnilovic Jan 4, 2021

Choose a reason for hiding this comment

Uh oh!

benedeki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benedeki Dec 17, 2020

Choose a reason for hiding this comment

Uh oh!

dk1844 Jan 4, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

benedeki commented Jan 4, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dk1844 Dec 14, 2020 •

edited

Loading

dk1844 Jan 4, 2021 •

edited

Loading