Data plays a prominent role in our lives today, even if we arent aware of its presence. There are many complex moral, legal, and philosophical questions about how we gather and use data; not least who actually owns it. After all, if I tell you how tall I am, do you now own my height? If you write the number down and sell the information on to someone else, am I entitled to a cut?
A case between data aggregator HiQ and social media platform LinkedIn highlights some of the difficult questions facing data scientists today.
The implications of the litigation between LinkedIn and HiQ are profound, but the case itself is simple enough to understand. It centered around LinkedIns invocation of the Computer Fraud and Abuse Act in a cease-and-desist letter to HiQ.
HiQ is a data analytics firm that provides business intelligence based on publicly-available data scraped from LinkedIn. Like many businesses today, they depend on access to public-facing data to be able to function. One of the unspoken but very salient questions raised by the case is where the line between public and private data lies.
Before you continue reading, how about a follow on LinkedIn?
The data that LinkedIn holds belongs to the company, inasmuch as it is being stored on their systems. However, the data itself consists only of what other people have submitted to LinkedIn. At the time of the case, the data was accessible to anyone who visited LinkedIn. From HiQs perspective, this meant that the data on LinkedIn was fair game for scraping. From LinkedIns perspective, their ToS prohibited the use of automation tools. They had a right to enforce those ToS by banning IP addresses associated with scraping.
With a growing number of entities scraping LinkedIn for data, the platform took action to terminate the accounts of suspected offenders. One of the businesses caught up in the bans was HiQ. They were able to easily circumvent the IP ban, by utilizing proxy services to mask the IP addresses they used for scraping.
LinkedIn responded by sending a cease-and-desist letter to HiQ. They asserted that not only had the firm breached LinkedIns ToS, but they had also violated the Computer Fraud and Abuse Act (CFAA), along with some other laws. HiQ responded with a lawsuit seeking an injunction against LinkedIn to prevent them from hindering HiQs access to data until the case was resolved.
In an opinion published in September 2019, the Ninth Circuit, while stopping short of issuing a definitive ruling, appeared to be leaning towards HiQs side. The Ninth Circuit made the significant decision to disregard some of its own prior rulings. This case was far from the first concerning how online services use the CFAA to enforce their own terms of service.
Giving the CFAA a broader focus so that it could be used to enforce a websites user agreement would have had a chilling effect on the then-nascent data scraping industry. In fact, the potential impact on internet users would have been far-reaching. Just about any internet user could be criminally liable for even minor infractions of a social media services ToS. The Ninth Circuits ruling in Nosal suggested that its interpretation of the CFAA was relatively narrow and that violations of the Act required more than a ToS violation.
However, two other decisions taken by the Ninth Circuit muddied the waters. One of these concerned a second decision in the Nosal case. The other was a ruling in an unrelated case, Facebook v. Power Ventures. In the second Nosal ruling, the court held that the term without authorization in the CFAA is not limited to circumventing access control using technical methods. A user gaining unauthorized access with legitimate login credentials could still be in violation of the act.
In the Power Ventures ruling, the court found that even though the data scraper had permission to access Facebook accounts using passwords and scrape data, it continued to do so after Facebook issued a cease-and-desist letter. This put Power Ventures in violation of the CFAA. Facebook had also blocked the IP address Power Ventures had initially used, although Power Ventures circumvention of this block was not in itself considered to be a violation.
A number of organizations, including the Electronic Frontier Foundation (EFF), have taken a particular interest in the case because it has far-reaching implications for data scraping. The case also presented an opportunity to overturn or limit the impact of the Ninth Circuits earlier rulings. The EFF feared this would have a chilling effect on innovation and web scraping.
In their cease-and-desist to HiQ, LinkedIn cited the Power Ventures case as evidence that continuing to access its data would mean HiQ was in violation of the CFAA. HiQ decided to beat LinkedIn to the punch and filed for a preliminary injunction. Despite the earlier Power Ventures ruling, the Ninth Circuit found that HiQ was likely to be successful in their claim that automated access to public-facing data was not a violation of the CFAA.
The Ninth Circuit ultimately upheld the preliminary injunction, but there is still potential for the case to come back to court.
During the case, the EFF filed an amicus brief that emphasized to the court how vital scraping is to a number of industries. Web scraping isnt just used commercially. It is vital for research and has a number of other beneficial uses.
The Ninth Circuit affirmed that any data that required no authorization to access and was freely available by default was fair game for scraping. As the court pointed out, authorization to access data is implicit unless steps are taken to restrict general access.
The ruling in HiQ v. LinkedIn means that judges in the future will have more leeway. It limits the significance of earlier rulings in the Power Ventures and Nosal cases. In those cases, the court was of the opinion that requiring a login before providing access to data would render it as private, not public, data.
This raises another problem, however. Upon logging in to Facebook, a wealth of otherwise private data is now easily available without restrictions. LinkedIn appears to have interpreted the courts ruling as meaning that any and all data that requires a login is private and LinkedIn can revoke access to it. As a result, LinkedIn is now requiring users to login before being able to browse the platform.
However, for many people, the most significant finding of the Ninth Circuit was that the CFAA exists to combat hacking and cannot be used as a catch-all enforcement document for enforcing a websites ToS.
Finally, the case touches on one of the most important data and privacy issues of our time. Who actually owns our personal data? The Ninth Circuits ruling would appear to affirm that it is us that owns our data. Any platforms we share that data with are merely licensed to use it, they dont own it outright.
Data scraping is an integral part of the modern internet ecosystem. It isnt about to go anywhere. LinkedIns interest in pursuing HiQ may have more to do with them competing to provide the same services than it does about any legitimate security or privacy concerns. It is worth noting that the Ninth Circuit listed a number of other potential legal remedies for businesses in LinkedIns position. The case will now return to the district court for a trial. A lot of people will be watching developments with great interest.
See the original post:
- Automation | Definition of Automation by Merriam-Webster - April 2nd, 2020
- Automation | Definition of Automation at Dictionary.com - April 2nd, 2020
- Automation (2019) - IMDb - April 2nd, 2020
- Automation - Advantages and disadvantages of automation ... - April 2nd, 2020
- Automation - The Car Company Tycoon Game on Steam - April 2nd, 2020
- Bridging the Network Automation Skills Gap - DevOps.com - April 2nd, 2020
- Yard work: Automation strolls out the warehouse door - ZDNet - April 2nd, 2020
- Five Steps To Get Started With Robotic Automation - Forbes - April 2nd, 2020
- ValueLink drives efficiency in the valuation process through a combination of automation, AI and analytics - HousingWire - April 2nd, 2020
- Bosses speed up automation as virus keeps workers home - The Guardian - April 2nd, 2020
- What is Feature Engineering and Why Does It Need To Be Automated? - Datanami - April 2nd, 2020
- Parascript and Le Mans Tech Partner to Offer Integrated Automation Solutions to Small and Midsize Banks - Yahoo Finance - April 2nd, 2020
- Smartly.io Powers Digital Advertising Innovation and Automation on Pinterest - Business Wire - April 2nd, 2020
- Overcoming the challenges of intelligent automation - ITProPortal - April 2nd, 2020
- Will the COVID-19 Pandemic Promote Mass Automation? - Walter Bradley Center for Natural and Artificial Intelligence - April 2nd, 2020
- Attractive Opportunities in Automation - Morningstar - April 2nd, 2020
- Automating with a solid foundation - ITProPortal - April 2nd, 2020
- Industry could fast-track automation amid COVID-19 fallout - www.mining-journal.com - April 2nd, 2020
- Building automation to generate $44bn in revenue per annum - Smart Energy - April 2nd, 2020
- The Idea Behind Automating Your Commissions - Yahoo Finance - April 2nd, 2020
- Why Intelligent Automation is the Need of the Hour - CXOToday.com - April 2nd, 2020
- Using RPA to automate internal audits where to start - TechHQ - April 2nd, 2020
- Scaling Bakery Production with Automated Mixing | 2020-04-01 - Baking Business - April 2nd, 2020
- Virtual assistant 'Larry the Chat Bot' brings automated solution... - Hays Free Press - April 2nd, 2020
- Aleyant Celebrates 15 Years of Innovation in Automated Print Software - Industry Analysts Inc - April 2nd, 2020
- Automation Anywhere helps deliver business continuity with RPA industry's first bot security program - Continuity Central - April 2nd, 2020
- Java vs. Python for test automation? Why Ruby is the better choice - TechBeacon - April 2nd, 2020
- Aido Technologies Launches Free Use of AI-Powered Automated CPT and Diagnosis Coding During the US Response to COVID-19 - Associated Press - April 2nd, 2020
- Automating with a solid foundation - ITProPortal - March 31st, 2020
- Five Steps To Get Started With Robotic Automation - Forbes - March 31st, 2020
- Bosses speed up automation as virus keeps workers home - The Guardian - March 31st, 2020
- How automation is speeding delivery of COVID-19 results to hospitals - TechRepublic - March 31st, 2020
- Reevaluating the Conversation on Automation and the Future of Work - Georgetown Public Policy Review - Georgetown Public Policy Review - March 31st, 2020
- Using RPA to automate internal audits where to start - TechHQ - March 31st, 2020
- What will drive the future of automation? - Essential Install - March 31st, 2020
- Home Working Causes Bosses to Increase Automation - DIGIT.FYI - March 31st, 2020
- Load.ng commits to leveraging Tech for Bitcoin to Naira Exchange Automation. - Techpoint Africa - March 31st, 2020
- news digest: Automation Anywhere's Bot Security, Linux 5.6, and the IntelliSense Code Linter for C++ - SD Times - SDTimes.com - March 31st, 2020
- COVID-19 Pandemic Pushes Logistic Automation up the Agenda - Sea News - March 31st, 2020
- Robotic Process Automation (RPA): Is It Recession Proof? - Forbes - March 16th, 2020
- How AI+Automation Can Transform Tedious Office-Tasks? - Analytics Insight - March 16th, 2020
- Software release cycles accelerate, but automation is not keeping up - ZDNet - March 16th, 2020
- Wannabe Wired: Don't fear the (automated) reaper - The Lawton Constitution - March 16th, 2020
- Study Shows 93% of Employees Believe Process Automation Will Make Jobs Easier - Supply and Demand Chain Executive - March 16th, 2020
- Centralizing Your Systems - Automation World - March 16th, 2020
- Automated trucking, a technical milestone that could disrupt hundreds of thousands of jobs, hits the road - 60 Minutes - CBS News - March 16th, 2020
- Smart bathrooms have been around since the 1980s. But do they help against coronavirus? - NorthJersey.com - March 16th, 2020
- How do financial advisors stay relevant in the age of automation? - Moneyweb.co.za - March 16th, 2020
- How SEOs can benefit from using AI and automation - Econsultancy - March 16th, 2020
- Robotic Process Automation Market World Informing, Growth Analysis And Opportunities Outlook 2020 To 2026 - News Times - March 16th, 2020
- Digital Transformation: UAE Banking Executive Explains How Fintech, AI, Robotic Process Automation is Transforming Finance Sector - Crowdfund Insider - March 16th, 2020
- Why robotic process automation (RPA) is the perfect technology for logistics - FreightWaves - March 16th, 2020
- Research Predicts Automated Deliveries Will Generate up to $48.4B in Revenue by 2030 - Supply and Demand Chain Executive - March 16th, 2020
- SOARIZON by Thales and Iris Automation announce partnership - sUAS News - March 16th, 2020
- This husband-wife duos startup helps companies automate their workflows - YourStory - March 16th, 2020
- Why Police Love the Idea of Automated Content Moderation - Slate - March 16th, 2020
- Smart Elevator Automation System Market 2019 Trends, Size, Segments, Emerging Technologies and Industry Growth by Forecast to 2023 - Packaging News 24 - March 16th, 2020
- Connectivity is the key to early wins in connected and automated mobility - FleetNews - March 16th, 2020
- 2.7m Aussie jobs at risk of automation - ACS - March 16th, 2020
- Bots bring automation to the war on data entry - C4ISRNet - March 16th, 2020
- Will The Tesla Model Y Allow Fully Automated Production To Happen? - InsideEVs - March 16th, 2020
- Bolt-on Software and Automation Technologies Showcased at MODEX - Benzinga - March 11th, 2020
- Helping workers survive automation - Bangkok Post - March 11th, 2020
- AI and Automation in the Workplace - Electropages - March 11th, 2020
- Packsize and 6 River Systems Collaborate on Next-generation Warehouse Automation Technology at MODEX - DC Velocity - March 11th, 2020
- Automation startup Zinier raises $90 million to expand in Brazil - Capacity Media - March 11th, 2020
- Don't miss the big Brexit automation opportunity to improve digital skills - Information Age - March 11th, 2020
- Why enterprises are turning to partnership automation - Gigabit Magazine - Technology News, Magazine and Website - March 11th, 2020
- Kryon Solidifies Leadership in Singapore's Robotic Process Automation Marketplace With Government Accreditation From IMDA - PRNewswire - March 11th, 2020
- Leveraging artificial intelligence to automate data extraction from geotagged images - Geospatial World - March 11th, 2020
- Royal Society of Arts calls for training amid automation rise - The National - March 11th, 2020
- Automated Material Handling Market Expected to Grow with a CAGR of 11.30% During the Forecast Period, 2020-2025 - ResearchAndMarkets.com - Business... - March 11th, 2020
- New and notable: Automation and AI - Greenhouse Canada - March 11th, 2020
- Call Center Automation Depends on Collaborative AI - Customer Think - March 11th, 2020
- Process Automation Software Market (2020 To 2027) is booming worldwide with Adobe, AppSheet, Automation Anywhere, Blue Prism - News Times - March 11th, 2020
- Munich Re Automation Solutions on the rise of the insurtech sector - Insurance Business - March 11th, 2020
- APAC Automation & Control Systems Market in the Upstream Oil & Gas Industry, Forecast to 2023 - Positive Market Sentiments Point to Slow... - March 6th, 2020
- Microsoft Goes All-In On RPA (Robotic Process Automation) - Forbes - March 6th, 2020
- Aging and automation shaping workforce, labor secretary tells Worcester business group - Worcester Telegram - March 6th, 2020
- How automation is taking some of the burden for healthcare workers - Tech Wire Asia - March 6th, 2020