Skip to content
View jiayuasu's full-sized avatar

Highlights

  • Pro

Organizations

@apache @conda-forge @DataOceanLab @wherobots @cloudnativegeo

Block or report jiayuasu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jiayuasu/README.md

About me

Jia Yu is a co-founder of Wherobots Inc., a Spatial Intelligence Cloud platform for spatial data ETL, analytics, and AI. He was a Tenure-Track Assistant Professor of Computer Science at Washington State University from 2020 to 2023. He obtained his Ph.D. in Computer Science from Arizona State University. His research focuses on large-scale database systems and geospatial data management. In particular, he worked on distributed geospatial data management systems, database indexing, and geospatial data visualization. Jia’s research outcomes have appeared in the most prestigious database / GIS conferences and journals, including SIGMOD, VLDB, ICDE, SIGSPATIAL and VLDB Journal. He is the main contributor of several open-sourced research projects such as Apache Sedona, a cluster computing framework for processing big spatial data, which receives 1 million downloads per month and has users / contributors from major companies.

My work on distributed geospatial database management systems (DBMSs)

I pioneered the field of distributed geospatial data management by creating the first full-fledged in-memory cluster computing framework for large-scale spatial analytics. My work became the industry standard, receiving over 2 million downloads per month and graduating as an Apache Software Foundation top-level project.

  • apache/sedona - A cluster computing framework for processing large-scale geospatial data
  • wherobots/havasu - The spatial table format for spatial lakehouse

My work on Apache Sedona community tools

My work on database indexing

I introduced a new class of lightweight and machine learning-enhanced database indexes that dramatically reduce storage overhead while maintaining query performance.

My work on geospatial data visualization

I developed scalable geospatial visualization techniques for big data, from offloading map rendering to distributed clusters to using smart sampling methods that reduce data-to-visualization time from minutes to sub-second.

  • jiayuasu/Tabula - Turbocharging Geospatial Visualization Dashboards via a Materialized Sampling Cube Approach

My previous life

My research lab and my students: https://jiayuasu.github.io/lab/

My publication: https://jiayuasu.github.io/publication/

Myself: https://jiayuasu.github.io/personal/

Pinned Loading

  1. apache/sedona apache/sedona Public

    A cluster computing framework for processing large-scale geospatial data

    Java 2.3k 747

  2. apache/sedona-db apache/sedona-db Public

    A single-node analytical database engine with geospatial as a first-class citizen

    Rust 398 42

  3. apache/sedona-spatialbench apache/sedona-spatialbench Public

    A benchmark for assessing geospatial SQL analytics query performance across database systems

    Rust 35 16

  4. apache/iceberg apache/iceberg Public

    Apache Iceberg

    Java 8.5k 3k

  5. microsoft/ALEX microsoft/ALEX Public

    A library for building an in-memory, Adaptive Learned indEX

    C++ 698 130

  6. stx-btree stx-btree Public

    Machine Learning based B+ Tree

    C++ 2