Development Tools - TiDB Future App Hackathon 2024

Tools from our Sponsors! :smiling_face_with_three_hearts:

Anyscale

The Anyscale Platform offers key advantages over Ray open source. It provides a seamless user experience for developers and AI teams to speed development, and deploy AI/ML workloads at scale. Companies using Anyscale benefit from rapid time-to-market and faster iterations across the entire AI lifecycle.

Getting started: Anyscale

Repository: GitHub - ray-project/ray: Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Dify

Dify is an intuitive genAI application development platform for everyone. Dify combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more on a visual interface, letting builders quickly go from prototype to production.

Getting started:

Repository: GitHub - langgenius/dify: Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Jina.ai

At Jina AI, we are building the search foundation, which consists of embeddings, reranker, prompt ops, and core infrastructure. These components work in concert to revolutionize how we search and understand data, thereby increasing relevance and reducing search time.

Getting started:

HuggingFace link: jinaai (Jina AI)

Integrate TiDB Vector Search with Jina AI Embedding: Integrate TiDB Vector Search with Jina AI Embeddings API | PingCAP Docs

Lepton AI

Lepton AI offers production-ready serverless APIs and dedicated deployments, enabling rapid and efficient scaling of AI applications for developers and enterprises.

Getting started:

Repository: GitHub - leptonai/leptonai: A Pythonic framework to simplify AI service building

LlamaIndex

LlamaIndex is a framework in Python and TypeScript for connecting your data to LLMs. It offers a quick, easy way to build generative AI applications including ways to load data from your database or file system, tools to index and embed that data, and engines for querying that data. You can quickly create chatbots, agents and more.

Getting started: Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.

Repositories: GitHub - run-llama/llama_index: LlamaIndex is a data framework for your LLM applications and GitHub - run-llama/LlamaIndexTS: LlamaIndex in TypeScript

NPi

NPi AI offers unified LLM Function calling framework and platform. With NPi, AI Agent can use API or Browser tools to operate and interact with various software and applications.

Getting Started:

Repository:

Hybrid Transactional/Analytical Processing Tool

Metabase

Desc: Metabase is an open-source business intelligence (BI) tool that enables users to easily query and visualize data from various sources, including SQL databases, NoSQL databases, and cloud storage. The platform is intuitive, user-friendly, and requires no coding knowledge, making it accessible to a wide range of users. Metabase offers many features, including dashboards, ad-hoc queries, and reporting, and it integrates seamlessly with tools like Slack and Google Sheets. The platform is highly customizable, extensible, and scalable, making it an ideal choice for businesses of all sizes looking to build a powerful and flexible BI solution.

Link: https://www.metabase.com/

Cubejs

Desc: Cube.js is a powerful open-source framework that can help developers build Business Intelligence (BI) applications quickly and efficiently. The framework offers a complete toolset for developing data pipelines, allowing for efficient querying, aggregation, and visualization of data from multiple sources. With Cube.js, developers can build data-intensive applications such as dashboards, interactive reports, and data visualizations. The framework integrates with popular BI tools such as Tableau, Power BI, and Looker, making it easier to analyze data from all sources in real-time. Cube.js is highly customizable and flexible, making it an excellent choice for businesses looking to build powerful and scalable BI solutions.

Link: https://cube.dev/

Grafana

Desc:Grafana is an open-source data analytics and visualization tool that enables users to query, visualize, and alert on data from multiple sources, including databases, cloud services, and IoT devices. The platform has a wide range of features, including interactive dashboards, real-time data streaming, and machine learning-powered alerting, making it an excellent choice for building BI applications. Grafana also has a flexible plugin architecture, which enables developers to extend its functionality to suit their specific data visualization needs. The platform is widely used in industries such as finance, healthcare, and IoT, to monitor and analyze data, and make data-driven decisions.

Link: https://grafana.com/

DBeaver

Desc: DBeaver is a free, open-source database management tool that provides a universal client for accessing SQL databases. The platform’s intuitive interface and seamless integration with multiple databases, including MySQL, PostgreSQL, Oracle, and Microsoft SQL Server, make it a preferred choice for managing data. With DBeaver, users can execute queries, browse and edit data, and manage database structures, all within a single application. The features include support for multiple tabs, database schema navigation, and database-backed searches. DBeaver also provides data export and import tools, with support for a wide range of formats. Overall, DBeaver is a versatile tool that simplifies database management and streamlines data processing.

Link: https://dbeaver.io/

DBT

Desc: dbt is a popular open-source command-line tool that provides an effective way for data analysts and engineers to manage their data pipelines, transforming and organizing their data in a way that is clean, maintainable, and scalable. dbt uses SQL to create reusable, modular transformations that can be easily tested, and managed. It also offers version control, automation, and testing capabilities that mirror software engineering practices. dbt’s powerful features, such as modeling as code, incremental modeling, and templating, make it an essential tool for any data team looking to improve their data transformation process. Overall, dbt streamlines and simplifies the data pipeline management process, making it more manageable and efficient.

Link: GitHub - dbt-labs/dbt-core: dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Airtyte

Desc: Airbyte is a free and open-source data integration platform that enables users to move and synchronize data across various systems, applications, and data sources. It offers a range of pre-built connectors to enable easy integration with popular systems such as Salesforce, Stripe, and Shopify, as well as open-source databases like MySQL and PostgreSQL. Airbyte is simple to install and use, and it provides users with real-time data transfer capabilities, data transformation tools, and monitoring and troubleshooting features. Overall, Airbyte is an excellent tool that simplifies data integration workflows and empowers users to extract more value from their data.

Link: https://airbyte.com/

Dagster

Desc: Dagster is a cloud-native data pipeline orchestrator for the whole development lifecycle, with integrated lineage and observability, a declarative programming model, and best-in-class testability.

Link: GitHub - dagster-io/dagster: An orchestration platform for the development, production, and observation of data assets.

Apache ECharts

Desc: Apache ECharts is an open-source visualization library that allows developers to easily create interactive and responsive charts and graphs for web-based analytics applications. It provides a wide range of chart types, including line, bar, scatter, pie, and more. With its powerful API and flexible configuration options, developers can customize the appearance and behavior of charts to suit their specific needs. ECharts supports a variety of data formats and can handle large datasets easily, making it an ideal choice for building analytics applications. Its ease of use and ability to integrate with other libraries and frameworks make it a popular choice among developers.

Link: Apache ECharts

API & Integration Tool

Drivers & ORMs

Prisma

Desc: Prisma unlocks a new level of developer experience when working with databases thanks to its intuitive data model, automated migrations, type-safety & auto-completion.

Link: https://www.prisma.io/

Gorm

Desc: The fantastic ORM library for Golang, aims to be developer friendly

Link: https://www.prisma.io/

TypeORM

Desc: TypeORM is an ORM that can run in NodeJS, Browser, Cordova, PhoneGap, Ionic, React Native, NativeScript, Expo, and Electron platforms and can be used with TypeScript and JavaScript (ES5, ES6, ES7, ES8). Its goal is to always support the latest JavaScript features and provide additional features that help you to develop any kind of application that uses databases - from small applications with a few tables to large scale enterprise applications with multiple databases.

Link: https://typeorm.io/

Sqlalchemy

Desc: SQLAlchemy is the Python SQL toolkit and Object Relational Mapper that gives application developers the full power and flexibility of SQL.

Link: https://www.sqlalchemy.org/

Zapier

Desc: Zapier is an automation platform that makes it easy for developers to build data-intensive apps that integrate with various web applications. With Zapier, developers can create workflows or “Zaps” that move data between different apps and automate routine tasks. Zapier provides developers with ample tools and resources to build their data app, including documentation, testing environments, and support. Once the data app is complete, it can be made available to the Zapier marketplace, where users can leverage its functionality to simplify their work processes. Building a data app on Zapier enables developers to bring disparate systems together and extract value from expansive data sets.

Link: https://zapier.com/

n8n

Desc: n8n is an open-source workflow automation tool that enables developers to build data-intensive applications without writing code. It allows users to create workflows, or “nodes,” that interconnect different data sources, APIs, and databases. n8n provides developers with numerous pre-built nodes, reusable building blocks that simplify the integration of different services. Additionally, they can create custom nodes based on their particular requirements. Once the workflow is tested and stable, it can be shared with other n8n users or created as a stand-alone application. Building a data app with n8n allows developers to extract value from disparate data sources and automate tasks without writing code.

Link: https://n8n.io/

Vercel

Desc: Vercel is a cloud platform that offers free and easy serverless hosting for websites and web applications. Developers can create and deploy their applications in a matter of seconds, with no infrastructure to manage. Vercel provides an intuitive, visual interface that simplifies the process of building serverless applications, enabling developers to focus on their code. Vercel’s global content delivery network and automatic scaling ensure that applications always perform optimally, with no downtime. With free hosting and pay-as-you-grow pricing, Vercel makes serverless development accessible to all, providing a streamlined and hassle-free path to launching web applications.

Link: https://vercel.com/

Cloudflare workers

Desc: Cloudflare Workers is a serverless computing platform that allows developers to deploy and run JavaScript code directly on Cloudflare’s global network. Workers can be used to perform a variety of functions, such as handling incoming requests, manipulating responses, and caching dynamic content. Because Workers run on Cloudflare’s network, they can be deployed globally in seconds, with powerful integrations allowing for routing and scripting. Workers can be triggered directly by HTTP requests or scheduled to run at specific times or intervals. By providing a flexible and scalable platform for running code at the edge, Workers can help developers build faster, more secure, and cost-effective applications.

Link: https://workers.cloudflare.com/

GitHub Action

Desc: GitHub Actions is an open-source platform for automating workflows and tasks, including integrations with various cloud services. AWS, Google Cloud, and TiDB Cloud are among the many services that developers can integrate with GitHub Actions. This integration enables developers to build, test, and deploy cloud applications directly within their repository. Developers can quickly deploy cloud operations using added features that automated deployment pipelines, enhancing their process of continuous integration and deployment. GitHub Actions provide a streamlined and integrated approach to cloud service delivery, allowing developers to boost productivity and reduce time spent on mundane tasks.

Link: Features • GitHub Actions · GitHub

AI Tool

ChatGPT

Desc: ChatGPT is a large language model developed by OpenAI that provides natural language processing services allowing users to interact with it using text-based input. It has been trained on vast amounts of text data, enabling it to respond to a wide range of questions and requests. ChatGPT is capable of answering questions, generating text, completing sentences, summarizing text, and more. By leveraging advanced natural language processing techniques, ChatGPT enables users to communicate effortlessly and effectively, whether for personal or professional use.

Link: https://chat.openai.com/

Gemini AI

Desc: Google Gemini is a powerful multimodal artificial intelligence (AI) model developed by Google and Alphabet, designed to understand and generate text, images, videos, and audio. Initially known as Google Bard, Gemini has been integrated into various Google services and products, including Google Search, Google Pixel devices, and Google Workspace.

Link: https://gemini.google.com/

AWS Bedrock

Desc: Amazon Bedrock is a fully managed service provided by Amazon Web Services (AWS) that offers access to a variety of high-performing foundation models (FMs) from leading AI companies and Amazon itself. These models can be used to build and scale generative AI applications securely and privately

Link: 使用基础模型构建生成式人工智能应用程序 – Amazon Bedrock – AWS

Claude AI

Desc:Claude AI, developed by Anthropic, is an advanced conversational AI assistant designed to perform a variety of language tasks such as text generation, summarization, analysis, and coding. Emphasizing safety, transparency, and alignment with human values, Claude can process up to 100,000 tokens at once, making it capable of understanding long-form content. It also supports vision analysis, multilingual processing, and code generation. Compared to ChatGPT, Claude offers enhanced reasoning capabilities and a stronger focus on ethical AI practices, positioning it as a trustworthy and capable alternative for complex cognitive tasks.

Link: https://claude.ai/

AutoGPT

Desc: Auto-GPT is an experimental open-source application showcasing the capabilities of the GPT-4 language model. This program, driven by GPT-4, chains together LLM “thoughts”, to autonomously achieve whatever goal you set. As one of the first examples of GPT-4 running fully autonomously, Auto-GPT pushes the boundaries of what is possible with AI.

Link:GitHub - Significant-Gravitas/AutoGPT: AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Langchain

Desc: LangChain is a framework for building language model-powered applications that goes beyond simple API calls. It enables advanced functionality like data-awareness and interaction with the environment, providing developers with the tools to build more powerful and differentiated applications. The LangChain framework is designed to be flexible, scalable, and easy to use, making it an ideal choice for developers looking to incorporate language models into their projects.

Link: GitHub - langchain-ai/langchain: :zap: Building applications with LLMs through composability :zap:

BabyAGI

Desc: Babyagi is a Python script that demonstrates an AI-powered task management system, utilizing OpenAI and Chroma to generate, prioritize, and execute tasks. The script creates tasks based on previous task results and a predefined objective, using NLP capabilities to generate new tasks. It showcases the power of AI when applied to task management.

Link: GitHub - yoheinakajima/babyagi

AgentGPT

Desc: Assemble, configure, and deploy autonomous AI Agents in your browser.

Link: GitHub - reworkd/AgentGPT: :robot: Assemble, configure, and deploy autonomous AI Agents in your browser.

Bark

Desc: Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference.

Link: GitHub - suno-ai/bark: :loud_sound: Text-Prompted Generative Audio Model

ChatBot UI

Desc: An open source ChatGPT UI.

Link: GitHub - mckaywrigley/chatbot-ui: An open source ChatGPT UI.

3 Likes