New Lawyers

THE FUTURE OF LEGAL ARTIFICIAL INTELLIGENCE (A.I.) – A DISCOURSE ON A.I. COMPONENTS, LEVELS, AND BIASES

Please share:

This article was originally posted by the American Bar Association. Reposted with permission.

June 2021

By Michael Andrew Iseri
Attorney & Software Engineer

A.   Introduction

The legal field—as with other professions—is undergoing a transformative phase that would integrate more advance technology into its legal services. Technology adoption rates have accelerated in 2020 due to Covid-19 restrictions for in-person meetings and legal hearings. Like at a trial with inadmissible evidence, the door is now wide-open to bring in technology.

This article serves as a primer on the current state of a.i.s and their application to the legal field. To note, there are few resources that clearly define legal technologies—especially legal a.i.s—without misleading marketing terms, grandiose claims/gimmicks, or incompatible real-world applications. Most importantly, there are different classes and case scenarios of a.i.s existing in the real world such as search engine a.i.s, content generation a.i.s, navigation a.i.s (such as self-driving cars), auto-response a.i.s, and much more. The knowledge in this article is based on the author’s unique perspective as an attorney and software engineer.[1] The information has also been vetted through numerous dialogues with various software engineers from Google and Uber in San Francisco and Silicon Valley.[2]

To begin, this article would provide an overview of the three components of an a.i. to better equip the reader with the ability to understand and characterize the different a.i. programs out in the real-world. Second, there would be a brief discussion on how the three a.i. components create the different levels of a.i.s complexities in the real world. Lastly, this article would provide an overview on the different types of biases that could “corrupt” an a.i. program during its development and implementation stages that would likely impact any development of a legal a.i.

B.   The Three A.I. Components – (1) Human Interfaces, (2) Intelligent Automatization (IA), and (3) Machine Learning (ML)

In the programming world, there is no such thing as a “true a.i.,” a program that codes itself to evolve and adapt. The stuff in Hollywood and films have tainted the populace’s perceptions on what a.i.s truly are for numerous professions.

There are three main components of a.i.s that can characterize an a.i. program in different professions. They are the following:

(1) Human Interface: This component is the main means of an a.i. program to communicate with humans, whether through sight, sound, touch (haptic feedback), or other means. Without this component, a program would not be able to receive or communicate back to humans on its operations. Main examples are dialogue boxes and webpages, a chatbot, voice interfaces such Amazon’s Alexa and Apple’s Siri, vibrations, sirens and alarms, and other means.

(2) Intelligent Automation Tools (IA): This component essentially defines an a.i. program’s identity and core functions by establishing its tools and operations. IA tools are coded instructions that provide the necessary means for an a.i. program to do what it is programmed to do. It is analogous to the tools that a human would use to accomplish a unique goal, such as using a saw and hammer to build a table. Most importantly, these IA tools have been programmed by humans, and no a.i. programs have been able to truly build its own IA tools outside of a control environment. Currently, IA tools are the limiting factors for a.i. programs to evolve since they require humans to program new parameters and functions. For example, a human can easily play chess or go that includes extra rows and columns than a conventional gameboard; however, an a.i. program would not be able to understand these extra outside rules without being programmed to anticipate that possibility if it is not within its existing IA tools parameters.

(3) Machine Learning (ML): This component allows a programmer to have an a.i. a program to fluctuate its own parameters to optimize itself or to find alternate solutions. The main benefit of ML is that it allows finer optimization at superior speeds of development by removing the programmer’s need to further fine-tune a program. For example, an a.i. program that identifies a particular fruit through images can do so without a programmer needing to continuously refine the parameters of that fruit at different angles and lighting. This is often accomplished by “feeding” the a.i. program libraries of existing images/information, such as giving an image a.i. program numerous different pictures of a single fruit for the program to refine its parameters. Additionally, the better ML tools would incorporate multiple layers of checks that conduct different analysis and judgment protocols on a particular task before coming to a consensus and a conclusion. Imagine this as numerous panels of appellate judges at different levels with different backgrounds trying to decide the outcome for a single case. To note, some people have misconstrued ML tools as an a.i. itself since they appear to perform the other two a.i. components (human interface and IA tools). This is a misconception since MLs just optimize and refine the other two a.i. components. At present, MLs cannot truly create their own IA tools outside controlled environments. An example would be a user asking a voice app the outside temperate in the mornings. The human interface receives/replies via voice while the IA tools checks an online database for the outside temperature. If built into the program, the ML tools would provide a more custom human interface response to the user (such as using the user’s name and making the response shorter) and could already run the IA tools for temperatures based on the user’s likely location in anticipation of that request.

C.   A Brief Overview of the Levels of an A.I. – “Simple A.I.s,” “Sophisticated A.I.s,” and “True A.I.s.”

When only two or three a.i. components are present, then you have a “simple a.i.” Contrary to its name, a “simple a.i.” can be extremely complex, and are often extremely efficient at accomplishing what they need to accomplish. From the author’s experience, only the human interface and IA tools are necessary components for a.i. programs to operate in professions, especially in the legal profession. Although there are numerous definitions of a.i.s you can look up, often the appearance of a program performing a complex or redundant task quickly demonstrates “intelligence.” The ML component is not necessary when an a.i. program does not need to refine itself after numerous uses or the a.i. program is easily adjustable by a programmer (which is a different topic on technology sustainability and deprecation). Examples of “simple a.i.s” in the legal profession would be basic document automatization programs/websites and e-discovery searching tools that try to find patterns based on inputted search terms.

A “sophisticated a.i.” is when you have multiple a.i. components or numerous “simple a.i.s” working in conjunction to accomplish numerous tasks. The main difference between a “simple a.i.” and a “sophisticated a.i.” is not on its complexity (though it could be a factor), but rather the enormity of separate a.i. components working along each other to accomplish an a.i. program task.

A “true a.i.” is when you have a program that creates its own IA tools without any humans programming it to learn these IA tools. To the best of the author’s knowledge, no “true a.i.s” exist outside of controlled environments that had humans already guiding the programs development. For example, you and I can try to play a new musical instrument tomorrow if we wanted to on a whim; however, an a.i. program would not have the ability to do so unless it is within its existing IA tools parameters to learn a new musical instrument. A program cannot “try” unless preprogrammed to do so. Until an a.i. program can learn a new skill on its own accord without any human intervention or guidance by humans, then a “true a.i.” are just tales best told in cinema.

D.  An Overview on the Program’s Bias Problem – (1) Programmer’s Bias, (2) Data Bias, and (3) Application Bias

Now that you know the components and levels of a.i. programs, creating a.i. programs for the legal field could be problematic due to the “Program’s Bias Problem.” Programs have multiple stages of development that biases could be introduced and impact an a.i. program. In current diversity and inclusion research, the Program’s Bias Problem is like “implicit biases” (a belief that there is an unconscious bias affecting value judgments that is manifested in every individual). It is the author’s belief that in the programming world, biases show up at three stages of a program’s life cycle: (1) programmer’s bias, (2) data bias, and (3) application bias.

The first stage of biases is introduced at the development and programming level of a program. A company’s development committee and its programmers must make binary cutoffs throughout various parts of a program’s code for the code to function. Often, in programming, the program’s responses and observances are reflected as binary inputs/outputs of strictly 1’s or 0’s (e.g. on/off; yes/no; white/back; etc.). Even when variance decision making is implemented (e.g., shades of gray), there are cut off points or thresholds in a program at the code level such as 50/50 cutoffs. The cutoff points are often made by the development committee and/or programmers implicitly or through machine learning (ML) algorithms that adjust those thresholds up and down; and these cutoff points are based on the biases of the development committee and/or programmers to accomplish their desired goals for an a.i. program.

The second stage is the data bias. For programs to function correctly with their intelligent automation (IA) tools and machine learning, they must be “fed” with vast amounts of data. The source of the data can be biased, and the bias data would make the program bias. An example is a college implementing an admissions acceptance a.i. program that would accept the best of the best candidates for the next school year.[3] If data is used from the college’s hundred-plus years history, especially during the pre-Civil Rights era, then the program would likely incorporate racist biases in its application processes.

The third stage is application bias. This stage is how the program is used in the real world and how it could affect the overall biases for the other two stages. The best way to describe this bias is through an example. Imagine that you have the best program to detect drug usage. Amazing, right? However, what if that drug detection program is only used on specific races at traffic stops such as for Hispanics and African Americans. The use of the program in of itself creates a bias (being used at only traffic stops and being used on selected individual groups), and this bias would have an unintended consequential loop that affects programmers bias and data bias.

The three stages of biases have actual real-world consequences. A popular story shared amongst diversity and inclusion advisers discuss an example using image search a.i. programs distinguishing images of chihuahuas’ faces and blueberry muffins.[4] The real-world controversial problem is actually a popular image app by Google in 2015 that tagged images of an African American couple with the photo tags of “Gorilla.”[5] After three years, Google “fixed” the problem by just removing the label “Gorilla” so no images would be labeled “Gorilla.”[6] This is a prime example on how the lack of oversight in developing the a.i. program to be better at its image recognition, not testing enough various data for better a.i. program development, and not testing the program in its application before its deployment. What resulted is an a.i. image search program that has underlying racist problems from program’s bias.

Another recent example involves the recent firing/resignation of a prominent African American Google A.I. scholar, Timnit Gebru, for a soon-to-be-published paper on the risks involving large-scale human language processing a.i. programs.[7] She warned that this large scale a.i. could drown out smaller and more nuance diction and linguistic cultural developments by larger and more text vocal majority.[8] MIT Technology Review describes one of the major conclusion of the paper as the following:

It [large-scale language processing a.i. programs] will also fail to capture the language and the norms of countries and peoples that have less access to the internet and thus a smaller linguistic footprint online. The result is that AI-generated language will be homogenized, reflecting the practices of the richest countries and communities.

She also highlights other problems such as the massive energy costs to train such an a.i. program from its Carbon footprint and electricity, the fact that an actual a.i. program would not understand human language rather than manipulating data to reflect that the a.i. program understand human language, and the potential use of having this a.i. program generating misinformation through an illusion of meaning if successful.[9] Gebru’s paper and warning further highlights the importance on oversight in developing any a.i. program at its inception with diverse perspectives, fully understand the different problems with existing and future data sources, and the outcomes the a.i. program could produce and further perpetuate biases in its application.

Conclusion

Considering that legal a.i.s would require using sources that are often not the best sources of diversity and perspectives, the future of legal a.i.s appear to be bleak in their actual unbiased development and application. One of the biggest problems for future legal a.i. programs would be based on numerous statistical findings of gender, race, LGBT+, and disability representation at law firms and courts. For example, representation of attorneys with disabilities in law firms are at 0.54% in 2017 compared to U.S. Census Bureau’s data report that shows ~20% of the general population has a disability in 2010.[10] The future of legal a.i.s and their problems would be just more prevalent as the legal profession embraces technology in the post-Covid world.

E.   Overview of the Author – Michael Andrew Iseri

Michael Andrew Iseri is an attorney, software engineer, cybersecurity professional, and a legal leader for accessible technology and diversity/inclusion matters. He creates the California Bar Exam through a four-year appointment from the Supreme Court of California serving on the California Committee of Bar Examiners (CBE), he is the Chair of Diversity for all California attorneys eight-years and under through the California Young Lawyers Association (CYLA), and he is in ABA’s Litigation Section’s Young Lawyers Leader Program (YLLP). He also mentors children and teenagers with disabilities through the Dyspraxia USA Foundation.

Google officially classified Michael Iseri as a “Legal Programming Unicorn,” one of the highest forms of recognition in the technology field, and brought him to Mountain View. He develops numerous legal a.i. programs that are fully voiced in 45+ languages that dynamically completes any legal procedure without needing any internet. He also has fully voiced homeless/natural disasters/Covid-19 resources map systems for California, fully voiced Constitution app, fully voiced Miranda Rights app in 12 languages, and other programs for free on Google Play Store by searching his solo owned company “LAWPP.” He is currently a finalist for the Global Legal Hackathon 2020 Competition as a solo member team.

He is also involved with cybersecurity matters. He published his thesis on the international legalities of Russia’s cyberattacks and cyber warfare initiatives in 2009 and graduated summa cum laude from the Legal Studies Program at UC Berkeley. He became CompTIA A+, Network+, and Security+ with an exam score of 874/900. He has presented twice on legal cybersecurity for law firms at the California Lawyers Association’s Annual Meeting.

Lastly, his life story has been published at ABA,[11] UCI Alumni Magazine (physical and digital form),[12] and the Dyspraxia USA Foundation[13] from being born with a severe disability that prevented him from talking when he was young to surviving a high school shooting where he was shot in the back-of-the-head.


[1] For a quick overview, the author has personally developed numerous legal “a.i.s” from scratch in 45+ fully voiced languages that dynamically complete legal services as well as fully voiced homeless/Covid-19 resource map systems for California, fully voiced Constitution and Miranda Rights programs, and legal guides on the Google Play Store. The author also had automated fully voiced Bar Exam flash card study programs, but they were decommissioned when the California Supreme Court appointed the author to the California Committee of Bar Examiners to create the July/February California Bar Exams for a four-year term.

[2] In the technology and programming fields, the best sources of information are from local meetups since technology moves extremely fast (faster than writing and publishing articles at times). It is easier to share and disseminate new technology concepts and best practices at pre-Covid meetups.

[3] NEW RESOURCE – “Untold History of AI: Algorithmic Bias Was Born in the 1980s” https://spectrum.ieee.org/tech-talk/tech-history/dawn-of-electronics/untold-history-of-ai-the-birth-of-machine-bias.

[4] NEW RESOURCE – “Chihuahua or muffin? My search for the best computer vision API” https://www.freecodecamp.org/news/chihuahua-or-muffin-my-search-for-the-best-computer-vision-api-cbda4d6b425d/

[5] NEW RESOURCE –  “Google engineer apologizes after Photos app tags two black people as gorillas” https://www.theverge.com/2015/7/1/8880363/google-apologizes-photos-app-tags-two-black-people-gorillas
NEW RESOURCE –  

[6] NEW RESOURCE – “Google ‘fixed’ its racist algorithm by removing gorillas from its image-labeling tech” https://www.theverge.com/2018/1/12/16882408/google-racist-gorillas-photo-recognition-algorithm-ai

[7] NEW RESOURCE – “Google AI researcher’s exit sparks ethics, bias concerns” [12/03/2020] https://apnews.com/article/business-apple-inc-artificial-intelligence-00c1dab0a727456df9e5ef9c6160c792

[8] NEW RESOURCE – “MIT Technology Review – We read the paper that forced Timnit Gebru out of Google. Here’s what it says.” https://www.technologyreview.com/2020/12/04/1013294/google-ai-ethics-research-paper-forced-out-timnit-gebru/

[9] NEW RESOURCE – “MIT Technology Review – We read the paper that forced Timnit Gebru out of Google. Here’s what it says.” https://www.technologyreview.com/2020/12/04/1013294/google-ai-ethics-research-paper-forced-out-timnit-gebru/

[10] NEW RESOURCE – “Are law firms committed to disability diversity? A handful of firms have taken action” https://www.abajournal.com/news/article/law_firms_disability_diversity

[11] https://www.americanbar.org/groups/litigation/committees/jiop/articles/2016/jiop-alumni-spotlight-michael-iseri-13/

[12] https://communications.uci.edu/magazine/2017/winter/taking-legal-power-to-the-people.php

[13] https://www.facebook.com/DyspraxiaUSA/posts/10151063115904597

©2021 by the American Bar Association.  Reprinted with permission.  All rights reserved.  This information or any or portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association.


Forgot Password

Enter the email associated with you account. You will then receive a link in your inbox to reset your password.

Personal Information

Select Section(s)

CLA Membership is $99 and includes one section. Additional sections are $99 each.

Payment