Run queries using the BigQuery DataFrames bigframes.pandas APIs
Stay organized with collections
Save and categorize content based on your preferences.
Use the BigQuery DataFrames bigframes.pandas APIs to perform data analysis via the BigQuery Query engine.
Explore further
For detailed documentation that includes this code sample, see the following:
Code sample
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License
, and code samples are licensed under the Apache 2.0 License
. For details, see the Google Developers Site Policies
. Java is a registered trademark of Oracle and/or its affiliates.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],[],[[["\u003cp\u003eUtilize the \u003ccode\u003ebigframes.pandas\u003c/code\u003e APIs to analyze data through the BigQuery Query engine.\u003c/p\u003e\n"],["\u003cp\u003eLoad data from BigQuery using \u003ccode\u003ebpd.read_gbq\u003c/code\u003e with a specified query or table.\u003c/p\u003e\n"],["\u003cp\u003eCompute aggregate statistics, such as the mean, on data in BigQuery DataFrames.\u003c/p\u003e\n"],["\u003cp\u003eUse groupby operations to find the mean of a certain variable based on a certain criteria.\u003c/p\u003e\n"]]],[],null,["# Run queries using the BigQuery DataFrames bigframes.pandas APIs\n\nUse the BigQuery DataFrames bigframes.pandas APIs to perform data analysis via the BigQuery Query engine.\n\nExplore further\n---------------\n\n\nFor detailed documentation that includes this code sample, see the following:\n\n- [Use BigQuery DataFrames](/bigquery/docs/use-bigquery-dataframes)\n\nCode sample\n-----------\n\n### Python\n\n\nBefore trying this sample, follow the Python setup instructions in the\n[BigQuery quickstart using\nclient libraries](/bigquery/docs/quickstarts/quickstart-client-libraries).\n\n\nFor more information, see the\n[BigQuery Python API\nreference documentation](/python/docs/reference/bigquery/latest).\n\n\nTo authenticate to BigQuery, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for client libraries](/bigquery/docs/authentication#client-libs).\n\n import bigframes.pandas as bpd\n\n # Load data from BigQuery\n query_or_table = \"bigquery-public-data.ml_datasets.penguins\"\n bq_df = bpd.read_gbq(query_or_table)\n\n # Inspect one of the columns (or series) of the DataFrame:\n bq_df[\"body_mass_g\"]\n\n # Compute the mean of this series:\n average_body_mass = bq_df[\"body_mass_g\"].mean()\n print(f\"average_body_mass: {average_body_mass}\")\n\n # Find the heaviest species using the groupby operation to calculate the\n # mean body_mass_g:\n (\n bq_df[\"body_mass_g\"]\n .groupby(by=bq_df[\"species\"])\n .mean()\n .sort_values(ascending=False)\n .head(10)\n )\n\nWhat's next\n-----------\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=bigquery)."]]