Skip to content

Commit 10d8f6a

Browse files
committed
Update
1 parent b373d7f commit 10d8f6a

File tree

7 files changed

+41
-5
lines changed

7 files changed

+41
-5
lines changed

_posts/papers/2023-07-03-2307.01026.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,9 @@ tags:
88
- Neural Information Processing Systems
99
link: https://arxiv.org/abs/2307.01026
1010
author: Shenyang Huang
11-
categories: Publications
11+
categories:
12+
- Publications
13+
- TGB
1214

1315
---
1416

_posts/papers/2024-06-14-2406.09639.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,9 @@ tags:
99
- arXiv.org
1010
link: https://arxiv.org/abs/2406.09639
1111
author: Shenyang Huang
12-
categories: Publications
12+
categories:
13+
- Publications
14+
- TGB
1315

1416
---
1517

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,29 @@
1+
---
2+
title: 'BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal
3+
Models on Document and Code Tasks'
4+
venue: ''
5+
names: Juan Rodriguez, Xiangru Jian, Siba Smarak Panigrahi, Tianyu Zhang, Aarash Feizi,
6+
Abhay Puri, Akshay Kalkunte, Franccois Savard, Ahmed Masry, Shravan Nayak, Rabiul
7+
Awal, Mahsa Massoud, Amirhossein Abaskohi, Zichao Li, Suyuchen Wang, Pierre-Andre
8+
Noel, M. L. Richter, Saverio Vadacchino, Shubbam Agarwal, Sanket Biswas, Sara Shanian,
9+
Ying Zhang, Noah Bolger, Kurt MacDonald, Simon Fauvel, Sathwik Tejaswi, Srinivas
10+
Sunkara, João Monteiro, K. Dvijotham, Torsten Scholak, Nicolas Chapados, Sepideh
11+
Kharagani, Sean Hughes, M. Ozsu, Siva Reddy, M. Pedersoli, Y. Bengio, Christopher
12+
Pal, I. Laradji, Spandanna Gella, Perouz Taslakian, David Vázquez, Sai Rajeswar
13+
tags:
14+
- ''
15+
link: https://arxiv.org/abs/2412.04626
16+
author: Aarash Feizi
17+
categories: Publications
18+
19+
---
20+
21+
*{{ page.names }}*
22+
23+
**{{ page.venue }}**
24+
25+
{% include display-publication-links.html pub=page %}
26+
27+
## Abstract
28+
29+
Multimodal AI has the potential to significantly enhance document-understanding tasks, such as processing receipts, understanding workflows, extracting data from documents, and summarizing reports. Code generation tasks that require long-structured outputs can also be enhanced by multimodality. Despite this, their use in commercial applications is often limited due to limited access to training data and restrictive licensing, which hinders open access. To address these limitations, we introduce BigDocs-7.5M, a high-quality, open-access dataset comprising 7.5 million multimodal documents across 30 tasks. We use an efficient data curation process to ensure our data is high-quality and license-permissive. Our process emphasizes accountability, responsibility, and transparency through filtering rules, traceable metadata, and careful content analysis. Additionally, we introduce BigDocs-Bench, a benchmark suite with 10 novel tasks where we create datasets that reflect real-world use cases involving reasoning over Graphical User Interfaces (GUI) and code generation from images. Our experiments show that training with BigDocs-Bench improves average performance up to 25.8% over closed-source GPT-4o in document reasoning and structured output tasks such as Screenshot2HTML or Image2Latex generation. Finally, human evaluations showed a preference for outputs from models trained on BigDocs over GPT-4o. This suggests that BigDocs can help both academics and the open-source community utilize and improve AI tools to enhance multimodal capabilities and document reasoning. The project is hosted at https://bigdocs.github.io .

_research_directions/temporal-graph-learning.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ header:
88
overlay_image: /assets/images/research_directions/temporal-graph-learning/banner.webp
99
logo_image_path: /assets/images/home/TGL_logo-light.png
1010
logo_dark_image_path: /assets/images/home/TGL_logo-dark.png
11-
one-liner: "Advancing the frontier of machine learning on time-evolving graphs to better model and predict dynamic real-world networks and relationships."
11+
one-liner: "When networks breathe, pulse, and transform, what fundamental truths about complexity and connection can machine learning unveil?"
1212
excerpt: "Our team brings together experts from fields like AI, data mining, and public health to improve how we analyze networks that change over time. We’re focused on creating better ways to predict outcomes and make decisions, especially in areas like fraud detection and disease tracking."
1313

1414
---

_research_directions/temporal-graph-learning/tga.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,9 @@ layout: splash
44
category: temporal-graph-learning
55
order: 2
66
header:
7-
overlay_filter: linear-gradient(rgba(255, 255, 255, 0.1), rgba(0, 0, 0, 0.5))
7+
overlay_filter:
88
overlay_image: /assets/images/research_directions/temporal-graph-learning/TGA.png
9+
overlay_size: "contain"
910
excerpt: ""
1011

1112
---

_research_directions/temporal-graph-learning/tgb.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,8 @@ header:
99
overlay_size: "contain"
1010
excerpt: ""
1111

12-
redirect_to: https://tgb.complexdatalab.com/
12+
1313
---
1414

15+
TGB is a collection of challenging and diverse benchmark datasets for realistic, reproducible, and robust evaluation for machine learning on temporal graphs. TGB includes both dynamic link and node property prediction tasks and an automated pipeline from dataset downloading, dataloading, evaluation and submission to the TGB leaderboard. Learn more [here](https://tgb.complexdatalab.com/)
1516

records/semantic_paper_ids_ignored.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,7 @@
99
"056a2ab1b2bb30bc8099660dfe5131e9660d1b90",
1010
"05d16d7f6dc2fbb571ae3ff7d8145f01c17443e1",
1111
"05d76a00dd42fd5e75a7222d4479d2a35608c7a3",
12+
"05fac44a97728b033f2bbf32f273a8b20b5dce40",
1213
"06e81ae4a28d9c4c6e79e1ddff0c485a0a178ff1",
1314
"09a34aad92e6f416c81f47e60de0809616b49cce",
1415
"0b483b550b21ec42d693fc04a372dbb10dd07019",

0 commit comments

Comments
 (0)