feat: added system test and sample for dataframe contains array by HemangChothani · Pull Request #365 · googleapis/python-bigquery

HemangChothani · 2020-11-04T12:28:29Z

Fixes #19

snippet-bot · 2020-11-04T12:28:34Z

Here is the summary of changes.

You added 1 region tag.

bigquery_load_table_dataframe_array_contains in samples/load_table_dataframe_array_contains.py

tswast · 2020-11-04T17:44:41Z

setup.py

-        # pyarrow 1.0.0 is required for the use of timestamp_as_object keyword.
-        "pyarrow >= 1.0.0, < 2.0dev",
+        # pyarrow 2.0.0 is required for the use of arrays in dataframe to load the table .
+        "pyarrow >= 2.0.0, < 3.0dev",


Let's not bump the minimum version here. Most features do work with 1.0, and pyarrow is a core library that is very useful to have a wide range of support.

tswast · 2020-11-04T17:46:51Z

tests/system.py

+                        None,
+                        (
+                            bigquery.SchemaField(
+                                "item", "INTEGER", "NULLABLE", None, (), None


Hmm... This is a bit of a surprising schema. It appears to match the behavior we were encountering previously. This feature is not supported if we cannot upload directly to a REPEATED INTEGER column.

tswast · 2020-11-04T17:49:05Z

samples/load_table_dataframe_array_contains.py

+    # table_id = "your-project.your_dataset.your_table_name"
+
+    dataframe = pandas.DataFrame({"A": [[1, 2, 3], [4, 5, 6], [7, 8, 9]]})
+    job = client.load_table_from_dataframe(dataframe, table_id)  # Make an API request.


Without an explicit schema, this sample is no different from the generic load_table_from_dataframe sample.

I was imagining system test XOR sample, as they are testing the same behavior.

tswast · 2020-11-04T22:25:12Z

I've sent #368 to capture just the desired setup.py changes.

It's possible there are some kinds of arrays (such as arrays of records) that are supported, but it appears arrays of scalars still aren't handled correctly.

feat: added system test and sample for dataframe contains array

683bbc9

HemangChothani requested review from a team and tswast November 4, 2020 12:28

HemangChothani requested a review from a team as a code owner November 4, 2020 12:28

HemangChothani requested a review from dinagraves November 4, 2020 12:28

google-cla bot added the cla: yes This human has signed the Contributor License Agreement. label Nov 4, 2020

product-auto-label bot added the api: bigquery Issues related to the googleapis/python-bigquery API. label Nov 4, 2020

tswast suggested changes Nov 4, 2020

View reviewed changes

tswast reviewed Nov 4, 2020

View reviewed changes

tswast mentioned this pull request Nov 4, 2020

BigQuery: Upload pandas DataFrame containing arrays #19

Closed

tswast closed this Nov 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: added system test and sample for dataframe contains array#365

feat: added system test and sample for dataframe contains array#365
HemangChothani wants to merge 1 commit intogoogleapis:masterfrom
MaxxleLLC:bigquery_issue_19

HemangChothani commented Nov 4, 2020

Uh oh!

snippet-bot bot commented Nov 4, 2020

Uh oh!

tswast Nov 4, 2020

Uh oh!

tswast Nov 4, 2020

Uh oh!

tswast Nov 4, 2020

Uh oh!

tswast commented Nov 4, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

HemangChothani commented Nov 4, 2020

Uh oh!

snippet-bot bot commented Nov 4, 2020

Uh oh!

tswast Nov 4, 2020

Choose a reason for hiding this comment

Uh oh!

tswast Nov 4, 2020

Choose a reason for hiding this comment

Uh oh!

tswast Nov 4, 2020

Choose a reason for hiding this comment

Uh oh!

tswast commented Nov 4, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants