Workshop
Workshop on Dataset Curation and Security
Nathalie Baracaldo · Yonatan Bisk · Avrim Blum · Michael Curry · John Dickerson · Micah Goldblum · Tom Goldstein · Bo Li · Avi Schwarzschild
Fri 11 Dec, 6 a.m. PST
Classical machine learning research has been focused largely on models, optimizers, and computational challenges. As technical progress and hardware advancements ease these challenges, practitioners are now finding that the limitations and faults of their models are the result of their datasets. This is particularly true of deep networks, which often rely on huge datasets that are too large and unwieldy for domain experts to curate them by hand. This workshop addresses issues in the following areas: data harvesting, dealing with the challenges and opportunities involved in creating and labeling massive datasets; data security, dealing with protecting datasets against risks of poisoning and backdoor attacks; policy, security, and privacy, dealing with the social, ethical, and regulatory issues involved in collecting large datasets, especially with regards to privacy; and data bias, related to the potential of biased datasets to result in biased models that harm members of certain groups. Dates and details can be found at securedata.lol
Schedule
Fri 6:00 a.m. - 6:30 a.m.
|
Dawn Song (topic TBD)
(
Invited talk
)
>
|
Dawn Song 🔗 |
Fri 6:30 a.m. - 7:00 a.m.
|
What Do Our Models Learn?
(
Invited talk
)
>
SlidesLive Video |
Aleksander Madry 🔗 |
Fri 7:00 a.m. - 7:15 a.m.
|
Discussion
(
Discussion panel
)
>
|
🔗 |
Fri 7:15 a.m. - 7:30 a.m.
|
Break
|
🔗 |
Fri 7:30 a.m. - 8:00 a.m.
|
Darrell West (TBD)
(
Invited talk
)
>
|
Darrell West 🔗 |
Fri 8:00 a.m. - 8:30 a.m.
|
Adversarial, Socially Aware, and Commonsensical Data
(
Invited talk
)
>
SlidesLive Video |
Yejin Choi 🔗 |
Fri 8:30 a.m. - 8:45 a.m.
|
Discussion panel
(
Discussion
)
>
|
🔗 |
Fri 8:45 a.m. - 10:00 a.m.
|
Lunch Break
(
Lunch Break
)
>
|
🔗 |
Fri 10:00 a.m. - 10:30 a.m.
|
Dataset Curation via Active Learning
(
Invited talk
)
>
|
Robert Nowak 🔗 |
Fri 10:30 a.m. - 11:00 a.m.
|
Don't Steal Data
(
Invited talk
)
>
|
Liz O'Sullivan 🔗 |
Fri 11:30 a.m. - 1:00 p.m.
|
Poster Session ( Poster Session ) > link | 🔗 |