Publications

Data Curation

2018
  • Thomer, A., Wickett, K., Baker, K. S., Fouke, B., & Palmer, C. L. (2018). Documenting provenance in non-computational workflows: Development of Research Process Models through a case study of geobiology research in Yellowstone National Park. In Journal of the Association for Information Science & Technology.

  • Weber, N., & Yan, A. (2018). Mining Open Government Data Used in Scientific Research. In iConference 2018 .

  • Yan, A., & Weber, N. (2018). Mining Open Government Data Used in Scientific Research. In International Conference on Information (iConference 2018). Springer.

2017
  • Weber, N. (2017). Contextual Integrity for Qualitative Data. In Digital Humanities 2017.

  • Grechkin, M., Poon, H., & Howe, W. G. (2017). EZLearn: Exploiting Organic Supervision in Large-Scale Data Annotation. In Learning with Limited Labeled Data: Weak Supervision and Beyond, 2017 NIPS Conference.

  • Stoyanovich, J., Howe, W. G., Abiteboul, S., Miklau, G., Sahuguet, A., & Weikum, G. (2017). Fides: Towards a Platform for Responsible Data Science. In ACM Conference on Scientific and Statistical Database Management (SSDBM).

  • Weber, N. (2017). Integrating User Feedback with Open Data Quality Models . In ASIS&T Annual Meeting 2017.

  • Hutchison, D., Howe, W. G., & Suciu, D. (2017). LaraDB: A Minimalist Kernel for Linear and Relational Algebra Computation. In BeyondMR workshop, 2017 ACM SIGMOD conference.

  • Weber, N. (2017). Orchestrating Cloud Infrastructure to Manage Sensitive Data . In International Digital Curation Conference.

  • Palmer, C. L., Thomer, A., & Fouke, B. (2017). Site-based data curation based on global hot-spring geobiology. In PLOS ONE, 12 (2) , e0172090.

  • Weber, N. (2017). Sticky and Leaky Abstractions for Data on the Web. In 11th U.S. Networked Knowledge Organization Systems (NKOS) Workshop.

  • Grechkin, M., Poon, H., & Howe, W. G. (2017). Wide-Open: Accelerating public data release by automating detection of overdue datasets. In PLOS Biology, 15 (6) ,

2016
  • Jain, S., & Howe, W. G. (2016). Data Cleaning in the Wild: Reusable Curation Idioms from a Multi-Year SQL Workload. In Proceedings of the 11th International Workshop on Quality in Databases (QDB'16).

  • Hutchison, D., Kepner, J., Gadepally, V., & Howe, W. G. (2016). From NoSQL Accumulo to NewSQL Graphulo: Design and utility of graph algorithms inside a BigTable database. In Proceedings of the High Performance Extreme Computing Conference (HPEC 2016), 1-9.

  • Jain, S., Moritz, D., & Howe, W. G. (2016). High Variety Cloud Databases. In Proceedings of the 2016 IEEE Cloud Data Management Workshop.

  • Hyrkas, J., & Howe, W. G. (2016). MusicDB: Relational Approach for Numeric Longitudinal Music Analytics. In Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR 2016), 702-708.

  • Jain, S., Moritz, D., Howe, W. G., & Lazowska, E. (2016). SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment. In SIGMOD '16: Proceedings of the 2016 International Conference on Management of Data, 281-293.

  • Hyrkas, J., Clayton, S., Ribalet, F., Halperin, D., Armbrust, E. V., & Howe, W. G. (2016). Scalable clustering algorithms for continuous environmental flow cytometry. In Bioinformatics, 32 (3) , 417–423.

  • Lee, P., West, J., & Howe, W. G. (2016). VizioMetrix: A Platform for Analyzing the Visual Information in Big Scholarly Data. In BigScholar Workshop (Third WWW Workshop on Big Scholarly Data: Towards the Web of Scholars).

  • Wongsuphasawat, K., Moritz, D., Anand, A., Mackinlay, J., Howe, W. G., & Heer, J. (2016). Voyager: Exploratory analysis via faceted browsing of visualization recommendations. In IEEE Transactions on Visualization and Computer Graphics, 22 (1) , 649–658. IEEE.

2015
  • Elmore, A., Duggan, J., Stonebraker, M., Balazinska, M., Cetintemel, U., Gadepally, V., Heer, J., Howe, W. G., Kepner, J., Kraska, T., Madden, S., Maier, D., Mattson, T., Papadopoulos, S., Parkhurst, J., Tatbul, N., Vartak, M., & Zdonik, S. (2015). A Demonstration of the BigDAWG Polystore System. In Proc. Very Large Database Endowment (PVLDB), 8 (12) ,

  • Chao, T. C., Cragin, M. H., & Palmer, C. L. (2015). Data Practices and Curation Vocabulary (DPCVocab): An Empirically Derived Framework of Scientific Data Practices and Curatorial Processes. In Journal of the Association for Information Science & Technology, 66 (3) , 616-633.

  • Mayernik, M. S., Thompson, C. A., Williams, V., Allard, S., Palmer, C. L., & Tenopir, C. (2015). Enriching Education with Exemplars in Practice: Iterative Development of Data Curation Internships. In International Journal of Digital Curation, 10 (1) , 123-134.

  • Thompson, C. A., Mayernik, M. S., Palmer, C. L., Allard, S., & Tenopir, C. (2015). LIS Programs and Data Centers: Integrating Expertise. In iConference 2015 Proceedings.

2014
2013
  • Thomer, A. K., Palmer, C. L., Wickett, K. M., Baker, K. S., Jett, J. G., DiLauro, T., Fouke, B. W., , Asangba, A. E., Rodman, A., & Choudhury, G. S. (2013). Data Curation for Geobiology at Yellowstone National Park. In CIRSS Technical Report, SBDC1301,

2011
  • Palmer, C. L., Weber, N. M., & Cragin, M. H. (2011). The analytic potential of scientific data: Understanding re-use potential. In Proceedings of the American Society for Information Science & Technology, 48 (1) ,

Unknown Year
  • Weber, N., & Brown, J. Remediating Civic Tech. In iConfrence 2018.