What is the best way to get (stream) data from BigQuery to R (Rstudio server in Docker) -
i have number of large tables in google bigquery, containing data processed in r. running rstudio via docker on google cloud platform using container engine.
i have tested few routes table of 38 million rows (three columns) table size of 862 mb in bigquery.
the first route tested using r package bigrquery. option preferred data can directly queried bigquery. , data-acquisition can incorporated in r loops. option unfortunately slow, takes close hour complete.
the second option tried exporting bigquery table csv file on google cloud storage (approx 1 minute), , using public link import in rstudio (another 5 minutes). route entails quite manual handling, @ least not desirable.
in google cloud console noticed vm instances can granted access bigquery. also, rstudio can configured have root access in docker container.
so question: there way use backdoor enable fast data-transfer bigquery r dataframe in automated way? or there other ways achieve goal?
any highly appreciated!
edit:
i have loaded same table mysql database hosted in google cloud sql, time took 20 seconds load same amount of data. kind of translation bigquery sql option too.
Comments
Post a Comment