Sample Applications & Development tools ——TiDB Future App Hackathon 2023

We all know that optimizing your time and being efficient at a hackathon yields the best results by letting you focus on your hack. We created this space to highlight some of the tools we use and some the community has submitted. We want this list to be a starting point or reference for new and seasoned hackers alike.

We have prepared the following types of tools for your reference:

  • Sample Applications built with TiDB Cloud Serverless
  • Development Tool
    • Hybrid Transactional/Analytical Processing Tool
    • API & Integration Tool
    • AI Tool

Sample Applications built with TiDB Cloud Serverless

OSSInsight based on Serverless TiDB & ChatGPT3.5

Desc: OSS Insight is an AI-powered tool that analyzes billions of rows of data from GitHub events. The tool has a powerful Data Explorer that can generate SQL queries and present visually insightful results. With in-depth analysis of individual developers and GitHub repositories, OSS Insight employs the use of big data to uncover valuable insights and trends in the open-source world.


Code: GitHub - pingcap/ossinsight: Analysis, Comparison, Trends, Rankings of Open Source Software, you can also get insight from more than 5 billion with natural language (powered by OpenAI). Follow us on Twitter:

Hacker News Insight based on AutoGPT

Desc: The Hacker News Insight app utilizes AutoGPT to generate accurate and relevant responses to user queries about Hacker News data stored in a TiDB serverless cloud database. AutoGPT is an experimental open-source application that showcases the capabilities of the GPT-4 language model. By combining AutoGPT with tidb serverless database, the Hacker News Insight app can provide valuable insights and information to users in almost real-time. The application has the potential to revolutionize the way users interact with Hacker News and demonstrates the potential of AI and machine learning in driving innovation and progress.


Code: GitHub - pingcap/realtime-hackernewsdb-on-serverless: Chat to query Hacker News database, based on Auto-GPT and TiDB Cloud Serverless Database

GitHub repo data pipeline

Desc: Repo-Track-Pipeline is an open-source tool that enables users to collect and analyze repository data from GitHub, with a focus on enabling users to own their data. Users can synchronize their repositories or groups of interesting repositories, keep the collected data in their own database, and have complete control over the data. Repo-Track-Pipeline allows users to perform different analytics on this data, such as analyzing data from a single repository, merging data from multiple repositories, comparing collections of repositories, and detecting fake stars. The tool also supports the synchronization of private repository data to ensure a comprehensive data collection.

Link: GitHub - hooopo/repo-track-pipeline: 🔄 A flexible open-source data pipeline for seamlessly syncing data from any repository to your database.

GitHub user data pipeline

Desc: This GitHub data pipeline is designed to sync all historical data for a user’s GitHub account to the TiDB serverless tier. The pipeline collects data from various sources such as the user’s repositories, commits, pull requests, issues, and comments, and loads it into TiDB serverless cloud database. By syncing all this data into a single, centralized location, users can access it via SQL queries, and use it to generate personalized dashboards, resumes, or digital personas, among other uses. The serverless tier allows users to scale up or down depending on their usage requirements, providing a flexible and cost-effective solution.


Repo: GitHub - hooopo/oh-my-github-pipeline: 🔄 A flexible open-source data pipeline for seamlessly syncing data from any github user to your database.

Hackernews Insight

Desc: Hackernews Insight is a cloud-based solution for collecting, storing, and visualizing data from Hackernews using TiDB Cloud for database management, Evidence for data visualization, and Vercel for hosting. The solution provides real-time insights into Hackernews data, allowing users to make informed decisions based on the collected data.

Repo: GitHub - hooopo/hackernews-insight: Hackernews Insight using TiDB Cloud


S&P500 Insight based on Vercel & Serverless TiDB & TiDB Data APi

Desc: The S&P500 Insight based on Vercel and Serverless TiDB & TiDB Data API is a powerful tool for analyzing the performance of stocks listed on the S&P500 index. Vercel is a popular cloud platform for hosting and deploying web applications, while Serverless TiDB is an open-source, cloud-native distributed SQL database that provides unparalleled scalability and high availability. The TiDB Data API allows developers to easily access and manipulate data stored in TiDB. Together, these technologies enable users to quickly and efficiently analyze stock performance data, helping them make informed investment decisions.


Repo: GitHub - pingcap/sp500-insight-template: SP500 Insight vercel template based on TiDB Cloud Data API:

Steam Insight - based on Vercel & Serverless TiDB

Desc: Steam Insights is a web application that leverages the power of Vercel and Serverless TiDB to help users analyze data related to Steam, a popular online gaming platform. Vercel offers frontend infrastructure, while Serverless TiDB is a distributed SQL database designed for cloud-native applications. Users can access real-time visualization dashboards to gain insights into Steam user activities, game sales, and other related data. Additionally, the TiDB database API allows users to perform custom queries on the data. Overall, Steam Insights is an invaluable tool for gaming industry analysts and data scientists.


Repo: GitHub - pingcap/steam-insight-template: Steam Insight vercel template based on TiDB Cloud Data API:

Development Tool

Hybrid Transactional/Analytical Processing Tool


Desc: Metabase is an open-source business intelligence (BI) tool that enables users to easily query and visualize data from various sources, including SQL databases, NoSQL databases, and cloud storage. The platform is intuitive, user-friendly, and requires no coding knowledge, making it accessible to a wide range of users. Metabase offers many features, including dashboards, ad-hoc queries, and reporting, and it integrates seamlessly with tools like Slack and Google Sheets. The platform is highly customizable, extensible, and scalable, making it an ideal choice for businesses of all sizes looking to build a powerful and flexible BI solution.



Desc: Cube.js is a powerful open-source framework that can help developers build Business Intelligence (BI) applications quickly and efficiently. The framework offers a complete toolset for developing data pipelines, allowing for efficient querying, aggregation, and visualization of data from multiple sources. With Cube.js, developers can build data-intensive applications such as dashboards, interactive reports, and data visualizations. The framework integrates with popular BI tools such as Tableau, Power BI, and Looker, making it easier to analyze data from all sources in real-time. Cube.js is highly customizable and flexible, making it an excellent choice for businesses looking to build powerful and scalable BI solutions.



Desc:Grafana is an open-source data analytics and visualization tool that enables users to query, visualize, and alert on data from multiple sources, including databases, cloud services, and IoT devices. The platform has a wide range of features, including interactive dashboards, real-time data streaming, and machine learning-powered alerting, making it an excellent choice for building BI applications. Grafana also has a flexible plugin architecture, which enables developers to extend its functionality to suit their specific data visualization needs. The platform is widely used in industries such as finance, healthcare, and IoT, to monitor and analyze data, and make data-driven decisions.



Desc: DBeaver is a free, open-source database management tool that provides a universal client for accessing SQL databases. The platform’s intuitive interface and seamless integration with multiple databases, including MySQL, PostgreSQL, Oracle, and Microsoft SQL Server, make it a preferred choice for managing data. With DBeaver, users can execute queries, browse and edit data, and manage database structures, all within a single application. The features include support for multiple tabs, database schema navigation, and database-backed searches. DBeaver also provides data export and import tools, with support for a wide range of formats. Overall, DBeaver is a versatile tool that simplifies database management and streamlines data processing.



Desc: dbt is a popular open-source command-line tool that provides an effective way for data analysts and engineers to manage their data pipelines, transforming and organizing their data in a way that is clean, maintainable, and scalable. dbt uses SQL to create reusable, modular transformations that can be easily tested, and managed. It also offers version control, automation, and testing capabilities that mirror software engineering practices. dbt’s powerful features, such as modeling as code, incremental modeling, and templating, make it an essential tool for any data team looking to improve their data transformation process. Overall, dbt streamlines and simplifies the data pipeline management process, making it more manageable and efficient.

Link: GitHub - dbt-labs/dbt-core: dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.


Desc: Airbyte is a free and open-source data integration platform that enables users to move and synchronize data across various systems, applications, and data sources. It offers a range of pre-built connectors to enable easy integration with popular systems such as Salesforce, Stripe, and Shopify, as well as open-source databases like MySQL and PostgreSQL. Airbyte is simple to install and use, and it provides users with real-time data transfer capabilities, data transformation tools, and monitoring and troubleshooting features. Overall, Airbyte is an excellent tool that simplifies data integration workflows and empowers users to extract more value from their data.



Desc: Dagster is a cloud-native data pipeline orchestrator for the whole development lifecycle, with integrated lineage and observability, a declarative programming model, and best-in-class testability.

Link: GitHub - dagster-io/dagster: An orchestration platform for the development, production, and observation of data assets.

Apache ECharts

Desc: Apache ECharts is an open-source visualization library that allows developers to easily create interactive and responsive charts and graphs for web-based analytics applications. It provides a wide range of chart types, including line, bar, scatter, pie, and more. With its powerful API and flexible configuration options, developers can customize the appearance and behavior of charts to suit their specific needs. ECharts supports a variety of data formats and can handle large datasets easily, making it an ideal choice for building analytics applications. Its ease of use and ability to integrate with other libraries and frameworks make it a popular choice among developers.

Link: Apache ECharts

API & Integration Tool

Drivers & ORMs


Desc: Prisma unlocks a new level of developer experience when working with databases thanks to its intuitive data model, automated migrations, type-safety & auto-completion.



Desc: The fantastic ORM library for Golang, aims to be developer friendly



Desc: TypeORM is an ORM that can run in NodeJS, Browser, Cordova, PhoneGap, Ionic, React Native, NativeScript, Expo, and Electron platforms and can be used with TypeScript and JavaScript (ES5, ES6, ES7, ES8). Its goal is to always support the latest JavaScript features and provide additional features that help you to develop any kind of application that uses databases - from small applications with a few tables to large scale enterprise applications with multiple databases.



Desc: SQLAlchemy is the Python SQL toolkit and Object Relational Mapper that gives application developers the full power and flexibility of SQL.


TiDB Serverless Docs

Desc: help you create/insert/monitor/alert xxxx


Desc: Zapier is an automation platform that makes it easy for developers to build data-intensive apps that integrate with various web applications. With Zapier, developers can create workflows or “Zaps” that move data between different apps and automate routine tasks. Zapier provides developers with ample tools and resources to build their data app, including documentation, testing environments, and support. Once the data app is complete, it can be made available to the Zapier marketplace, where users can leverage its functionality to simplify their work processes. Building a data app on Zapier enables developers to bring disparate systems together and extract value from expansive data sets.



Desc: n8n is an open-source workflow automation tool that enables developers to build data-intensive applications without writing code. It allows users to create workflows, or “nodes,” that interconnect different data sources, APIs, and databases. n8n provides developers with numerous pre-built nodes, reusable building blocks that simplify the integration of different services. Additionally, they can create custom nodes based on their particular requirements. Once the workflow is tested and stable, it can be shared with other n8n users or created as a stand-alone application. Building a data app with n8n allows developers to extract value from disparate data sources and automate tasks without writing code.



Desc: Vercel is a cloud platform that offers free and easy serverless hosting for websites and web applications. Developers can create and deploy their applications in a matter of seconds, with no infrastructure to manage. Vercel provides an intuitive, visual interface that simplifies the process of building serverless applications, enabling developers to focus on their code. Vercel’s global content delivery network and automatic scaling ensure that applications always perform optimally, with no downtime. With free hosting and pay-as-you-grow pricing, Vercel makes serverless development accessible to all, providing a streamlined and hassle-free path to launching web applications.


Cloudflare workers

Desc: Cloudflare Workers is a serverless computing platform that allows developers to deploy and run JavaScript code directly on Cloudflare’s global network. Workers can be used to perform a variety of functions, such as handling incoming requests, manipulating responses, and caching dynamic content. Because Workers run on Cloudflare’s network, they can be deployed globally in seconds, with powerful integrations allowing for routing and scripting. Workers can be triggered directly by HTTP requests or scheduled to run at specific times or intervals. By providing a flexible and scalable platform for running code at the edge, Workers can help developers build faster, more secure, and cost-effective applications.


GitHub Action

Desc: GitHub Actions is an open-source platform for automating workflows and tasks, including integrations with various cloud services. AWS, Google Cloud, and TiDB Cloud are among the many services that developers can integrate with GitHub Actions. This integration enables developers to build, test, and deploy cloud applications directly within their repository. Developers can quickly deploy cloud operations using added features that automated deployment pipelines, enhancing their process of continuous integration and deployment. GitHub Actions provide a streamlined and integrated approach to cloud service delivery, allowing developers to boost productivity and reduce time spent on mundane tasks.

Link: Features • GitHub Actions · GitHub

TiDB Cloud Data API Docs

AI Tool


Desc: ChatGPT is a large language model developed by OpenAI that provides natural language processing services allowing users to interact with it using text-based input. It has been trained on vast amounts of text data, enabling it to respond to a wide range of questions and requests. ChatGPT is capable of answering questions, generating text, completing sentences, summarizing text, and more. By leveraging advanced natural language processing techniques, ChatGPT enables users to communicate effortlessly and effectively, whether for personal or professional use.



Desc: Auto-GPT is an experimental open-source application showcasing the capabilities of the GPT-4 language model. This program, driven by GPT-4, chains together LLM “thoughts”, to autonomously achieve whatever goal you set. As one of the first examples of GPT-4 running fully autonomously, Auto-GPT pushes the boundaries of what is possible with AI.

Link:GitHub - Significant-Gravitas/Auto-GPT: An experimental open-source attempt to make GPT-4 fully autonomous.


Desc: LangChain is a framework for building language model-powered applications that goes beyond simple API calls. It enables advanced functionality like data-awareness and interaction with the environment, providing developers with the tools to build more powerful and differentiated applications. The LangChain framework is designed to be flexible, scalable, and easy to use, making it an ideal choice for developers looking to incorporate language models into their projects.

Link: GitHub - hwchase17/langchain: ⚡ Building applications with LLMs through composability ⚡


Desc: Babyagi is a Python script that demonstrates an AI-powered task management system, utilizing OpenAI and Chroma to generate, prioritize, and execute tasks. The script creates tasks based on previous task results and a predefined objective, using NLP capabilities to generate new tasks. It showcases the power of AI when applied to task management.

Link: GitHub - yoheinakajima/babyagi


Desc: Assemble, configure, and deploy autonomous AI Agents in your browser.

Link: GitHub - reworkd/AgentGPT: 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.


Desc: LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM’s with external data.

Link: GitHub - jerryjliu/llama_index: LlamaIndex (GPT Index) is a data framework for your LLM applications


Desc: Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference.

Link: GitHub - suno-ai/bark: 🔊 Text-Prompted Generative Audio Model

ChatBot UI

Desc: An open source ChatGPT UI.

Link: GitHub - mckaywrigley/chatbot-ui: An open source ChatGPT UI.

If you have any thoughts or discussions you would like to share, feel free to join us on Discord.