Big Data in R with Arrow

posit::conf(2024) 1-day workshop

Nic Crane + Steph Hazlitt

Welcome ๐Ÿ‘‹

WiFi

  • Username: Posit Conf 2024
  • Password: conf2024


Workshop

Housekeeping


Gender Neutral Bathrooms

  • Located on levels 3, 4, 5, 6 & 7

Specialty Rooms

  • Meditation/Prayer Room (503)
  • Lactation Room (509)

*Available Mon & Tues 7am - 7pm, and Wed 7am - 5pm

Photos


Red Lanyards NO


Please note everyoneโ€™s lanyard colors before taking a photo and respect their choices.

Code of Conduct


posit.co/code-of-conduct/

  • Contact any posit::conf staff member, identifiable by their staff t-shirt, or visit the conference general information desk.
  • Send a message to conf@posit.com; event organizers will respond promptly.
  • Call +1-844-448-1212; this phone number will be monitored for the duration of the event.

Meet Your Teaching Team


Co-Instructors

  • Nic Crane
  • Steph Hazlitt

Teaching Assistant

  • Jonathan Keane

Meet Each Other


  • When did you use R for the first time?
  • What is your favorite R package?
  • Which package hex sticker would you like to find the most during posit::conf(2024)?

Getting Help Today


GREEN sticky note: I am OK / I am done

PINK sticky note: I need support / I am working


You can ask questions at any time during the workshop

Discord

  • pos.it/conf-event-portal (login)
  • Click on โ€œJoin Discord, the virtual networking platform!โ€
  • Browse Channels -> #workshop-arrow

We Assume

  • You know
  • You are familiar with the dplyr package for data manipulation
  • You have data in your life that is too large to fit into memory or sluggish in memory
  • You want to learn how to engineer your data storage for more performant access and analysis

Posit Workbench: Login ๐Ÿ› ๏ธ

  • Join Workbench via URL in the #workshop-arrow Discord channel
  • Select Posit Workbench >> Sign in with OpenID
  • Use your GitHub credentials to log in (click the icon)

Posit Workbench: Setup ๐Ÿฝ๏ธ

  • ๐Ÿ–ฑ +New Session
  • ๐Ÿ–ฑ Start Session (defaults are fine)
  • Run usethis::use_course("posit-conf-2024/arrow")

Posit Workbench: Setup ๐Ÿฝ๏ธ

  • Default location: Yes!
  • Unzip ๐Ÿ“: Yes!
  • Open Session dialog box: Resource Profile >> select Large
  • Open + run data/setup.R ๐ŸŽ‰