Search Results for author: Chun-Liang Li

Found 54 papers, 26 papers with code

CodecLM: Aligning Language Models with Tailored Synthetic Data

no code implementations • 8 Apr 2024 • Zifeng Wang, Chun-Liang Li, Vincent Perot, Long T. Le, Jin Miao, Zizhao Zhang, Chen-Yu Lee, Tomas Pfister

To this end, we introduce CodecLM, a general framework for adaptively generating high-quality synthetic data for LLM alignment with different downstream instruction distributions and LLMs.

Instruction Following

Paper
Add Code

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

no code implementations • 9 Jan 2024 • Zilong Wang, Hao Zhang, Chun-Liang Li, Julian Martin Eisenschlos, Vincent Perot, Zifeng Wang, Lesly Miculicich, Yasuhisa Fujii, Jingbo Shang, Chen-Yu Lee, Tomas Pfister

We propose the Chain-of-Table framework, where tabular data is explicitly used in the reasoning chain as a proxy for intermediate thoughts.

Ranked #3 on Table-based Fact Verification on TabFact

Fact Verification In-Context Learning +3

Paper
Add Code

Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models

no code implementations • 1 Aug 2023 • Cheng-Yu Hsieh, Si-An Chen, Chun-Liang Li, Yasuhisa Fujii, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister

Today, large language models (LLMs) are taught to use new tools by providing a few demonstrations of the tool's usage.

Image Generation

Paper
Add Code

Re-Benchmarking Pool-Based Active Learning for Binary Classification

1 code implementation • 15 Jun 2023 • Po-Yi Lu, Chun-Liang Li, Hsuan-Tien Lin

Active learning is a paradigm that significantly enhances the performance of machine learning models when acquiring labeled data is expensive.

Active Learning Benchmarking +2

Paper
Code

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

no code implementations • 4 May 2023 • Chen-Yu Lee, Chun-Liang Li, Hao Zhang, Timothy Dozat, Vincent Perot, Guolong Su, Xiang Zhang, Kihyuk Sohn, Nikolai Glushnev, Renshen Wang, Joshua Ainslie, Shangbang Long, Siyang Qin, Yasuhisa Fujii, Nan Hua, Tomas Pfister

In FormNetV2, we introduce a centralized multimodal graph contrastive learning strategy to unify self-supervised pre-training for all modalities in one loss.

Contrastive Learning document understanding +1

Paper
Add Code

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

1 code implementation • 3 May 2023 • Cheng-Yu Hsieh, Chun-Liang Li, Chih-Kuan Yeh, Hootan Nakhost, Yasuhisa Fujii, Alexander Ratner, Ranjay Krishna, Chen-Yu Lee, Tomas Pfister

Third, we reduce both the model size and the amount of data required to outperform LLMs; our finetuned 770M T5 model outperforms the few-shot prompted 540B PaLM model using only 80% of available data on a benchmark, whereas standard finetuning the same T5 model struggles to match even by using 100% of the dataset.

343

Paper
Code

TSMixer: An All-MLP Architecture for Time Series Forecasting

2 code implementations • 10 Mar 2023 • Si-An Chen, Chun-Liang Li, Nate Yoder, Sercan O. Arik, Tomas Pfister

Extending them, in this paper, we investigate the capabilities of linear models for time-series forecasting and present Time-Series Mixer (TSMixer), a novel architecture designed by stacking multi-layer perceptrons (MLPs).

Time Series Time Series Forecasting

32,952

Paper
Code

Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval

1 code implementation • CVPR 2023 • Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister

Existing methods rely on supervised learning of CIR models using labeled triplets consisting of the query image, text specification, and the target image.

Ranked #1 on Zero-shot Image Retrieval on ImageNet-R

Attribute Retrieval +2

139

Paper
Code

Neural Spline Search for Quantile Probabilistic Modeling

no code implementations • 12 Jan 2023 • Ruoxi Sun, Chun-Liang Li, Sercan O. Arik, Michael W. Dusenberry, Chen-Yu Lee, Tomas Pfister

Accurate estimation of output quantiles is crucial in many use cases, where it is desired to model the range of possibility.

Attribute regression +2

Paper
Add Code

Hyperbolic Contrastive Learning for Visual Representations beyond Objects

1 code implementation • CVPR 2023 • Songwei Ge, Shlok Mishra, Simon Kornblith, Chun-Liang Li, David Jacobs

To exploit such a structure, we propose a contrastive learning framework where a Euclidean loss is used to learn object representations and a hyperbolic loss is used to encourage representations of scenes to lie close to representations of their constituent objects in a hyperbolic space.

Contrastive Learning Image Classification +5

Paper
Code

SPADE: Semi-supervised Anomaly Detection under Distribution Mismatch

no code implementations • 30 Nov 2022 • Jinsung Yoon, Kihyuk Sohn, Chun-Liang Li, Sercan O. Arik, Tomas Pfister

Semi-supervised anomaly detection is a common problem, as often the datasets containing anomalies are partially labeled.

Semi-supervised Anomaly Detection Supervised Anomaly Detection

Paper
Add Code

Prefix Conditioning Unifies Language and Label Supervision

no code implementations • CVPR 2023 • Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister

In experiments, we show that this simple technique improves the performance in zero-shot image recognition accuracy and robustness to the image-level distribution shift.

Classification Contrastive Learning +2

Paper
Add Code

Learning Instance-Specific Adaptation for Cross-Domain Segmentation

no code implementations • 30 Mar 2022 • Yuliang Zou, Zizhao Zhang, Chun-Liang Li, Han Zhang, Tomas Pfister, Jia-Bin Huang

We propose a test-time adaptation method for cross-domain image segmentation.

Data Augmentation Domain Generalization +5

Paper
Add Code

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

no code implementations • ACL 2022 • Chen-Yu Lee, Chun-Liang Li, Timothy Dozat, Vincent Perot, Guolong Su, Nan Hua, Joshua Ainslie, Renshen Wang, Yasuhisa Fujii, Tomas Pfister

Sequence modeling has demonstrated state-of-the-art performance on natural language and document understanding tasks.

document understanding

Paper
Add Code

Decoupling Local and Global Representations of Time Series

1 code implementation • 4 Feb 2022 • Sana Tonekaboni, Chun-Liang Li, Sercan Arik, Anna Goldenberg, Tomas Pfister

Learning representations that capture the factors contributing to this variability enables a better understanding of the data via its underlying generative process and improves performance on downstream machine learning tasks.

counterfactual Time Series +1

Paper
Code

Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types

2 code implementations • 21 Dec 2021 • Kihyuk Sohn, Jinsung Yoon, Chun-Liang Li, Chen-Yu Lee, Tomas Pfister

We define a distance function between images, each of which is represented as a bag of embeddings, by the Euclidean distance between weighted averaged embeddings.

Anomaly Detection Clustering +2

Paper
Code

Improving Model Compatibility of Generative Adversarial Networks by Boundary Calibration

no code implementations • 3 Nov 2021 • Si-An Chen, Chun-Liang Li, Hsuan-Tien Lin

To improve GAN in terms of model compatibility, we propose Boundary-Calibration GANs (BCGANs), which leverage the boundary information from a set of pre-trained classifiers using the original data.

Paper
Add Code

A Unified View of cGANs with and without Classifiers

1 code implementation • NeurIPS 2021 • Si-An Chen, Chun-Liang Li, Hsuan-Tien Lin

Conditional Generative Adversarial Networks (cGANs) are implicit generative models which allow to sample from class-conditional distributions.

Paper
Code

Robust Contrastive Learning Using Negative Samples with Diminished Semantics

1 code implementation • NeurIPS 2021 • Songwei Ge, Shlok Mishra, Haohan Wang, Chun-Liang Li, David Jacobs

We also show that model bias favors texture and shape features differently under different test settings.

Contrastive Learning Data Augmentation +1

Paper
Code

Unifying Distribution Alignment as a Loss for Imbalanced Semi-supervised Learning

no code implementations • 29 Sep 2021 • Justin Lazarow, Kihyuk Sohn, Chun-Liang Li, Zizhao Zhang, Chen-Yu Lee, Tomas Pfister

While remarkable progress in imbalanced supervised learning has been made recently, less attention has been given to the setting of imbalanced semi-supervised learning (SSL) where not only is a few labeled data provided, but the underlying data distribution can be severely imbalanced.

Pseudo Label

Paper
Add Code

Object-aware Contrastive Learning for Debiased Scene Representation

1 code implementation • NeurIPS 2021 • Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn, Chun-Liang Li, Jinwoo Shin

Contrastive self-supervised learning has shown impressive results in learning visual representations from unlabeled images by enforcing invariance against different data augmentations.

Contrastive Learning Object +2

Paper
Code

ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

no code implementations • ACL 2021 • Chen-Yu Lee, Chun-Liang Li, Chu Wang, Renshen Wang, Yasuhisa Fujii, Siyang Qin, Ashok Popat, Tomas Pfister

Natural reading orders of words are crucial for information extraction from form-like documents.

Paper
Add Code

Self-supervise, Refine, Repeat: Improving Unsupervised Anomaly Detection

no code implementations • 11 Jun 2021 • Jinsung Yoon, Kihyuk Sohn, Chun-Liang Li, Sercan O. Arik, Chen-Yu Lee, Tomas Pfister

We demonstrate our method on various unsupervised AD tasks with image and tabular data.

Classification One-Class Classification +3

Paper
Add Code

DISSECT: Disentangled Simultaneous Explanations via Concept Traversals

1 code implementation • ICLR 2022 • Asma Ghandeharioun, Been Kim, Chun-Liang Li, Brendan Jou, Brian Eoff, Rosalind W. Picard

Explaining deep learning model inferences is a promising venue for scientific understanding, improving safety, uncovering hidden biases, evaluating fairness, and beyond, as argued by many scholars.

counterfactual Fairness +2

Paper
Code

CutPaste: Self-Supervised Learning for Anomaly Detection and Localization

2 code implementations • CVPR 2021 • Chun-Liang Li, Kihyuk Sohn, Jinsung Yoon, Tomas Pfister

We aim at constructing a high performance model for defect detection that detects unknown anomalous patterns of an image without anomalous data.

Ranked #55 on Anomaly Detection on MVTec AD

Data Augmentation Defect Detection +4

222

Paper
Code

Learning and Evaluating Representations for Deep One-class Classification

1 code implementation • ICLR 2021 • Kihyuk Sohn, Chun-Liang Li, Jinsung Yoon, Minho Jin, Tomas Pfister

We first learn self-supervised representations from one-class data, and then build one-class classifiers on learned representations.

Ranked #7 on Anomaly Detection on One-class CIFAR-100

Classification Contrastive Learning +6

151

Paper
Code

PseudoSeg: Designing Pseudo Labels for Semantic Segmentation

2 code implementations • ICLR 2021 • Yuliang Zou, Zizhao Zhang, Han Zhang, Chun-Liang Li, Xiao Bian, Jia-Bin Huang, Tomas Pfister

We demonstrate the effectiveness of the proposed pseudo-labeling strategy in both low-data and high-data regimes.

Ranked #5 on Semi-Supervised Semantic Segmentation on COCO 1/32 labeled

Data Augmentation Image Classification +2

162

Paper
Code

i-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

3 code implementations • ICLR 2021 • Kibok Lee, Yian Zhu, Kihyuk Sohn, Chun-Liang Li, Jinwoo Shin, Honglak Lee

Contrastive representation learning has shown to be effective to learn representations from unlabeled data.

Contrastive Learning Representation Learning

Paper
Code

Interpretable Sequence Learning for COVID-19 Forecasting

no code implementations • NeurIPS 2020 • Sercan O. Arik, Chun-Liang Li, Jinsung Yoon, Rajarishi Sinha, Arkady Epshteyn, Long T. Le, Vikas Menon, Shashank Singh, Leyou Zhang, Nate Yoder, Martin Nikoltchev, Yash Sonthalia, Hootan Nakhost, Elli Kanal, Tomas Pfister

We propose a novel approach that integrates machine learning into compartmental disease modeling to predict the progression of COVID-19.

Paper
Add Code

Kernel Stein Generative Modeling

no code implementations • 6 Jul 2020 • Wei-Cheng Chang, Chun-Liang Li, Youssef Mroueh, Yiming Yang

NCK is crucial for successful inference with SVGD in high dimension, as it adapts the kernel to the noise level of the score estimate.

Bayesian Inference

Paper
Add Code

A Simple Semi-Supervised Learning Framework for Object Detection

7 code implementations • 10 May 2020 • Kihyuk Sohn, Zizhao Zhang, Chun-Liang Li, Han Zhang, Chen-Yu Lee, Tomas Pfister

Semi-supervised learning (SSL) has a potential to improve the predictive performance of machine learning models using unlabeled data.

Ranked #13 on Semi-Supervised Object Detection on COCO 100% labeled data (using extra training data)

Data Augmentation Image Classification +4

399

Paper
Code

Unsupervised Program Synthesis for Images By Sampling Without Replacement

no code implementations • 27 Jan 2020 • Chenghui Zhou, Chun-Liang Li, Barnabas Poczos

However, they struggle with the inherent sparsity of meaningful programs in the search space.

Program Synthesis Reinforcement Learning (RL)

Paper
Add Code

FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

27 code implementations • NeurIPS 2020 • Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel

Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model's performance.

Ranked #3 on Semi-Supervised Image Classification on SVHN, 1000 labels

Pseudo Label Semi-Supervised Image Classification

1,061

Paper
Code

Learned Interpolation for 3D Generation

no code implementations • 8 Dec 2019 • Austin Dill, Songwei Ge, Eunsu Kang, Chun-Liang Li, Barnabas Poczos

The typical approach for incorporating this creative process is to interpolate in a learned latent space so as to avoid the problem of generating unrealistic instances by exploiting the model's learned structure.

3D Generation

Paper
Add Code

Getting Topology and Point Cloud Generation to Mesh

no code implementations • 8 Dec 2019 • Austin Dill, Chun-Liang Li, Songwei Ge, Eunsu Kang

In this work, we explore the idea that effective generative models for point clouds under the autoencoding framework must acknowledge the relationship between a continuous surface, a discretized mesh, and a set of points sampled from the surface.

Point Cloud Generation

Paper
Add Code

On Completeness-aware Concept-Based Explanations in Deep Neural Networks

2 code implementations • NeurIPS 2020 • Chih-Kuan Yeh, Been Kim, Sercan O. Arik, Chun-Liang Li, Tomas Pfister, Pradeep Ravikumar

Next, we propose a concept discovery method that aims to infer a complete set of concepts that are additionally encouraged to be interpretable, which addresses the limitations of existing methods on concept explanations.

Paper
Code

On Concept-Based Explanations in Deep Neural Networks

no code implementations • 25 Sep 2019 • Chih-Kuan Yeh, Been Kim, Sercan Arik, Chun-Liang Li, Pradeep Ravikumar, Tomas Pfister

Next, we propose a concept discovery method that considers two additional constraints to encourage the interpretability of the discovered concepts.

Paper
Add Code

Developing Creative AI to Generate Sculptural Objects

no code implementations • 20 Aug 2019 • Songwei Ge, Austin Dill, Eunsu Kang, Chun-Liang Li, Lingyao Zhang, Manzil Zaheer, Barnabas Poczos

We explore the intersection of human and machine creativity by generating sculptural objects through machine learning.

Clustering Generating 3D Point Clouds

Paper
Add Code

LBS Autoencoder: Self-supervised Fitting of Articulated Meshes to Point Clouds

no code implementations • CVPR 2019 • Chun-Liang Li, Tomas Simon, Jason Saragih, Barnabás Póczos, Yaser Sheikh

As input, we take a sequence of point clouds to be registered as well as an artist-rigged mesh, i. e. a template mesh equipped with a linear-blend skinning (LBS) deformation space parameterized by a skeleton hierarchy.

Paper
Add Code

Implicit Kernel Learning

no code implementations • 26 Feb 2019 • Chun-Liang Li, Wei-Cheng Chang, Youssef Mroueh, Yiming Yang, Barnabás Póczos

While learning the kernel in a data driven way has been investigated, in this paper we explore learning the spectral distribution of kernel via implicit generative models parametrized by deep neural networks.

Text Generation

Paper
Add Code

Kernel Change-point Detection with Auxiliary Deep Generative Models

2 code implementations • ICLR 2019 • Wei-Cheng Chang, Chun-Liang Li, Yiming Yang, Barnabás Póczos

Detecting the emergence of abrupt property changes in time series is a challenging problem.

Change Point Detection Time Series +1

Paper
Code

Hallucinating Point Cloud into 3D Sculptural Object

no code implementations • 13 Nov 2018 • Chun-Liang Li, Eunsu Kang, Songwei Ge, Lingyao Zhang, Austin Dill, Manzil Zaheer, Barnabas Poczos

Our approach extends DeepDream from images to 3D point clouds.

Object

Paper
Add Code

Point Cloud GAN

1 code implementation • 13 Oct 2018 • Chun-Liang Li, Manzil Zaheer, Yang Zhang, Barnabas Poczos, Ruslan Salakhutdinov

In this paper, we first show a straightforward extension of existing GAN algorithm is not applicable to point clouds, because the constraint required for discriminators is undefined for set data.

Object Recognition

Paper
Code

Beyond Pixel Norm-Balls: Parametric Adversaries using an Analytically Differentiable Renderer

no code implementations • ICLR 2019 • Hsueh-Ti Derek Liu, Michael Tao, Chun-Liang Li, Derek Nowrouzezahrai, Alec Jacobson

As such, we propose the direct perturbation of physical parameters that underly image formation: lighting and geometry.

Data Augmentation

Paper
Add Code

Nonparametric Density Estimation under Adversarial Losses

no code implementations • NeurIPS 2018 • Shashank Singh, Ananya Uppal, Boyue Li, Chun-Liang Li, Manzil Zaheer, Barnabás Póczos

We study minimax convergence rates of nonparametric density estimation under a large class of loss functions called "adversarial losses", which, besides classical $\mathcal{L}^p$ losses, includes maximum mean discrepancy (MMD), Wasserstein distance, and total variation distance.

Density Estimation

Paper
Add Code

Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond

2 code implementations • 5 Apr 2018 • Xi Ouyang, Yu Cheng, Yifan Jiang, Chun-Liang Li, Pan Zhou

The results show that our framework can smoothly synthesize pedestrians on background images of variations and different levels of details.

Ranked #2 on Scene Text Recognition on MSDA

Generative Adversarial Network Pedestrian Detection +1

323

Paper
Code

Sobolev GAN

2 code implementations • ICLR 2018 • Youssef Mroueh, Chun-Liang Li, Tom Sercu, Anant Raj, Yu Cheng

We show that the Sobolev IPM compares two distributions in high dimensions based on weighted conditional Cumulative Distribution Functions (CDF) of each coordinate on a leave one out basis.

Text Generation

Paper
Code

One Network to Solve Them All -- Solving Linear Inverse Problems Using Deep Projection Models

1 code implementation • ICCV 2017 • J. H. Rick Chang, Chun-Liang Li, Barnabas Poczos, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan

While deep learning methods have achieved state-of-the-art performance in many challenging inverse problems like image inpainting and super-resolution, they invariably involve problem-specific training of the networks.

Compressive Sensing Image Inpainting +1

Paper
Code

MMD GAN: Towards Deeper Understanding of Moment Matching Network

2 code implementations • NeurIPS 2017 • Chun-Liang Li, Wei-Cheng Chang, Yu Cheng, Yiming Yang, Barnabás Póczos

In this paper, we propose to improve both the model expressiveness of GMMN and its computational efficiency by introducing adversarial kernel learning techniques, as the replacement of a fixed Gaussian kernel in the original GMMN.

Computational Efficiency Generative Adversarial Network

190

Paper
Code

Data-driven Random Fourier Features using Stein Effect

no code implementations • 23 May 2017 • Wei-Cheng Chang, Chun-Liang Li, Yiming Yang, Barnabas Poczos

Large-scale kernel approximation is an important problem in machine learning research.

Paper
Add Code

One Network to Solve Them All --- Solving Linear Inverse Problems using Deep Projection Models

2 code implementations • 29 Mar 2017 • J. H. Rick Chang, Chun-Liang Li, Barnabas Poczos, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan

On the other hand, traditional methods using signal priors can be used in all linear inverse problems but often have worse performance on challenging tasks.

Compressive Sensing Image Inpainting +1

Paper
Code

CMU DeepLens: Deep Learning For Automatic Image-based Galaxy-Galaxy Strong Lens Finding

1 code implementation • 8 Mar 2017 • Francois Lanusse, Quanbin Ma, Nan Li, Thomas E. Collett, Chun-Liang Li, Siamak Ravanbakhsh, Rachel Mandelbaum, Barnabas Poczos

We find on our simulated data set that for a rejection rate of non-lenses of 99%, a completeness of 90% can be achieved for lenses with Einstein radii larger than 1. 4" and S/N larger than 20 on individual $g$-band LSST exposures.

Instrumentation and Methods for Astrophysics Cosmology and Nongalactic Astrophysics Astrophysics of Galaxies

Paper
Code

Annealing Gaussian into ReLU: a New Sampling Strategy for Leaky-ReLU RBM

no code implementations • 11 Nov 2016 • Chun-Liang Li, Siamak Ravanbakhsh, Barnabas Poczos

Due to numerical stability and quantifiability of the likelihood, RBM is commonly used with Bernoulli units.

Paper
Add Code

Rivalry of Two Families of Algorithms for Memory-Restricted Streaming PCA

no code implementations • 4 Jun 2015 • Chun-Liang Li, Hsuan-Tien Lin, Chi-Jen Lu

In this paper, we analyze the convergence rate of a representative algorithm with decayed learning rate (Oja and Karhunen, 1985) in the first family for the general $k>1$ case.

Vocal Bursts Valence Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.