
Legal Data Scientist, Lillian Goldman Law Library
Yale University See More Job Openings by This Employer- Central Campus
1. Extract huge volumes of data from multiple internal and external sources. 2. Conduct undirected research and frame open-ended industry questions. 3. Employ sophisticated analytics programs, machine learning and statistical methods to prepare data for use in predictive and prescriptive modeling. 4. Thoroughly clean and prune data to discard irrelevant information. 5. Explore and examine data from a variety of angles to determine hidden weaknesses, trends and/or opportunities. 6. Devise data-driven solutions to the most pressing challenges. 7. Invent new algorithms to solve problems and build new tools to automate work. 8. Communicate predictions and findings to management and IT departments through effective data visualizations and reports. 9. Utilize real-time data streams to generate predictive and prognostic analytical outputs.
Required Education and Experience
Bachelors’ degree in computer science, mathematics or a related subject and six years of experience, or an equivalent combination of education and experience.
Background Check Requirements
All candidates for employment will be subject to pre-employment background screening for this position, which may include motor vehicle, DOT certification, drug testing and credit checks based on the position description and job requirements. All offers are contingent upon the successful completion of the background check. For additional information on the background check requirements and process visit "Learn about background checks" under the Applicant Support Resources section of Careers on the It's Your Yale website.
Position Focus:
The Lillian Goldman Law Library at Yale Law School seeks an inquisitive and collaborative Legal Data Scientist to meet the growing demand for supporting social scientific research and inquiry at Yale Law School. This position provides project-based support and consultation services for Law School student and faculty researchers throughout the entire research lifecycle. The Legal Data Scientist will enhance local capacity to create, maintain, and promote research-ready data deliverables and help raise scientific standards for empirical legal research. The Legal Data Scientist will work closely with researchers, research support specialists, students, and librarians at the Law School and around Yale University to deliver responsive and accurate data support services that meet local research needs. As such, the position involves staying informed of developments in research data management, research design, data analysis, and computational social science. This role presents an exciting opportunity to contribute to the scholarly and pedagogical mission of Yale Law School, empowering students, and faculty to produce compelling, transformative, and innovative research at a high scientific standard.
Yale Law School Data Services supports law faculty, students, and staff in producing transparent research, transforming data into information, and sharing reusable data. The Lillian Goldman Law Library is a strategic asset to Yale Law School and Yale University, advancing intellectual discovery, information literacy, and lifelong learning, all in support of the University’s strategic goals in scholarship, education, preservation, and practice. The Legal Data Scientist works collaboratively with Law Archive Co-administrators and technology partner, the Center for Open Science in the management of the Law Archive open access legal scholarship repository. Collaborates with other Yale Library and University data interest groups.
Responsibilities: Generate analysis-ready datasets from multiple input sources and data types, including websites, electronic documents, social media posts, APIs, and tabular datasets. Develop and maintain APIs and other programming functions to automate the collection and management of frequently used research data sources. Perform descriptive analyses and interpret results using a combination of tables, graphics, and text for both non-technical and expert audiences. Leverage web-based data applications and visualizations to promote broad discoverability and engagement with ongoing research projects and programs across the Yale Law School research community. Develop and organize introductory and advanced workshops, courses, and other educational programming on topics related to empirical legal research and social science research methods. Supervise and direct the performance of student research assistants and/or consultants. Engage in research, scholarly publication, and professional service activities in areas related to core responsibilities. Complete other administrative and research duties as assigned.
Preferred Education, Experience and Skills:
Advanced degree (M.A./M.S. or Ph.D.) in computer/data science or applied social science. Familiar with social scientific research methods. Experienced in legal information, American legal system, Stata, econometric research, and sensitive data. Skilled in developing/delivering curriculum. Proficient in virtual machines and cloud-based environments (e.g., Azure, AWS).
Posting Disclaimer
The intent of this job description is to provide a representative summary of the essential functions that will be required of the position and should not be construed as a declaration of specific duties and responsibilities of the particular position. Employees will be assigned specific job-related duties through their hiring departments.
The University is committed to basing judgments concerning the admission, education, and employment of individuals upon their qualifications and abilities and seeks to attract to its faculty, staff, and student body qualified persons from a broad range of backgrounds and perspectives. In accordance with this policy and as delineated by federal and Connecticut law, Yale does not discriminate in admissions, educational programs, or employment against any individual on account of that individual’s sex, sexual orientation, gender identity or expression, race, color, national or ethnic origin, religion, age, disability, status as a special disabled veteran, veteran of the Vietnam era or other covered veteran.
Inquiries concerning Yale’s Policy Against Discrimination and Harassment may be referred to the Office of Institutional Equity and Accessibility (OIEA).
Required Skill/Ability 1:
Applied experience working in data science, computer engineering, or related profession(s). Strong technical skills and problem-solving abilities related to research data management, database design, and data analysis.
Required Skill/Ability 2:
Familiarity with identifying the needs of researchers and providing individualized services and guidance that meet those needs.
Required Skill/Ability 3:
Project-based experience with parsing, cleaning, and extracting entities from text-based data using a combination of natural language processing methods, machine-learning algorithms, and/or large language models.
Required Skill/Ability 4:
Proficiency in R and/or Python programming languages and SQL. Working knowledge of HTML, JavaScript, and web design to facilitate entity extraction from web-based sources and the creation of data-based web applications.
Required Skill/Ability 5:
Version control software (e.g., Git and GitHub). Demonstrated interpersonal and teamwork skills complemented by the ability to take initiative. Excellent oral and written communication skills.
Health Requirements
Certain positions have associated health requirements based on specific job responsibilities. These may include vaccinations, tests, or examinations, as required by law, regulation, or university policy.