Svoboda | Graniru | BBC Russia | Golosameriki | Facebook
skip to main content
10.1145/1401890.1401959acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Discrimination-aware data mining

Published: 24 August 2008 Publication History
  • Get Citation Alerts
  • Abstract

    In the context of civil rights law, discrimination refers to unfair or unequal treatment of people based on membership to a category or a minority, without regard to individual merit. Rules extracted from databases by data mining techniques, such as classification or association rules, when used for decision tasks such as benefit or credit approval, can be discriminatory in the above sense. In this paper, the notion of discriminatory classification rules is introduced and studied. Providing a guarantee of non-discrimination is shown to be a non trivial task. A naive approach, like taking away all discriminatory attributes, is shown to be not enough when other background knowledge is available. Our approach leads to a precise formulation of the redlining problem along with a formal result relating discriminatory rules with apparently safe ones by means of background knowledge. An empirical assessment of the results on the German credit dataset is also provided.

    References

    [1]
    R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In Proc. of VLDB 1994, pages 487--499. Morgan Kaufmann, 1994.
    [2]
    R. Agrawal and R. Srikant. Privacy-preserving data mining. In Proc. of SIGMOD 2000, pages 439--450. ACM, 2000.
    [3]
    G. S. Becker. The Economics of Discrimination. University of Chicago Press, 1957.
    [4]
    C. Clifton. Privacy preserving data mining: How do we mine data when we aren't allowed to see it? Tutorial at KDD 2003. http://www.cs.purdue.edu/homes/clifton.
    [5]
    B. Goethals. Frequent Itemset Mining Implementations Repository, http://fimi.cs.helsinki.fi.
    [6]
    H. Holzer, S. Raphael, and M. Stoll. Black job applicants and the hiring officer's race. Industrial and Labor Relations Review, 57(2):267--287, 2004.
    [7]
    D.H. Kaye and M. Aickin, editors. Statistical Methods in Discrimination Litigation. Marcel Dekker, Inc., 1992.
    [8]
    P. Kuhn. Sex discrimination in labor markets: The role of statistical evidence. The American Economic Review, 77:567--583, 1987.
    [9]
    M. LaCour-Little. Discrimination in mortgage lending: A critical review of the literature. J. of Real Estate Literature, 7:15--50, 1999.
    [10]
    B. Liu, W. Hsu, and Y. Ma. Integrating classification and association rule mining. In Proc. of KDD 1998, pages 80--86. AAAI Press, 1998.
    [11]
    D.J. Newman, S. Hettich, C.L. Blake, and C.J. Merz. UCI repository of machine learning databases, 1998. http://archive.ics.uci.edu/ml.
    [12]
    D. Pedreschi, S. Ruggieri, and F. Turini. Discrimination-aware data mining. Tech. Rep. 07-19, Dip. Inf., Univ. of Pisa, 2007. http://compass2.di.unipi.it/TR.
    [13]
    J. Rauch and M. Simunek. Mining for association rules by 4ft-Miner. In Proc. of INAP 2001, pages 285--295. Prolog Association of Japan, 2001. http://lispminer.vse.cz.
    [14]
    G. D. Squires. Racial profiling, insurance style: Insurance redlining and the uneven development of metropolitan areas. J. of Urban Affairs, 25(4):391--410, 2003.
    [15]
    L. Sweeney. Computational Disclosure Control: A Primer on Data Privacy Protection. PhD thesis, MIT, 2001.
    [16]
    P.-N. Tan, V. Kumar, and J. Srivastava. Selecting the right objective measure for association analysis. Inf. Syst., 29(4):293--313, 2004.
    [17]
    U.S. Federal Legislation. http://www.usdoj.gov.
    [18]
    X. Yin and J. Han. CPAR: Classification based on Predictive Association Rules. In Proc. of SIAM DM 2003, SIAM, 2003.

    Cited By

    View all
    • (2024)Metric-Independent Mitigation of Unpredefined Bias in Machine ClassificationIntelligent Computing10.34133/icomputing.00833Online publication date: 8-Apr-2024
    • (2024)How can Consumers Without Credit History Benefit from the Use of Information Processing and Machine Learning Tools by Financial Institutions?SSRN Electronic Journal10.2139/ssrn.4730445Online publication date: 2024
    • (2024)Goal Orientation for Fair Machine Learning AlgorithmsProduction and Operations Management10.1177/10591478241234998Online publication date: 18-Mar-2024
    • Show More Cited By

    Index Terms

    1. Discrimination-aware data mining

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2008
      1116 pages
      ISBN:9781605581934
      DOI:10.1145/1401890
      • General Chair:
      • Ying Li,
      • Program Chairs:
      • Bing Liu,
      • Sunita Sarawagi
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 August 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. classification rules
      2. discrimination

      Qualifiers

      • Research-article

      Conference

      KDD08

      Acceptance Rates

      KDD '08 Paper Acceptance Rate 118 of 593 submissions, 20%;
      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '24

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)208
      • Downloads (Last 6 weeks)13
      Reflects downloads up to 31 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Metric-Independent Mitigation of Unpredefined Bias in Machine ClassificationIntelligent Computing10.34133/icomputing.00833Online publication date: 8-Apr-2024
      • (2024)How can Consumers Without Credit History Benefit from the Use of Information Processing and Machine Learning Tools by Financial Institutions?SSRN Electronic Journal10.2139/ssrn.4730445Online publication date: 2024
      • (2024)Goal Orientation for Fair Machine Learning AlgorithmsProduction and Operations Management10.1177/10591478241234998Online publication date: 18-Mar-2024
      • (2024)New Digital Divide Shaped by Algorithm? Evidence from Agent-Based Testing on Douyin’s Health-Related Video RecommendationCommunication Research10.1177/00936502241262056Online publication date: 30-Jul-2024
      • (2024)Average User-Side Counterfactual Fairness for Collaborative FilteringACM Transactions on Information Systems10.1145/365663942:5(1-26)Online publication date: 13-May-2024
      • (2024)Fairness Testing: A Comprehensive Survey and Analysis of TrendsACM Transactions on Software Engineering and Methodology10.1145/365215533:5(1-59)Online publication date: 4-Jun-2024
      • (2024)The Impact of Explanations on Fairness in Human-AI Decision-Making: Protected vs Proxy FeaturesProceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640543.3645210(155-180)Online publication date: 18-Mar-2024
      • (2024)Towards Environmentally Equitable AI via Geographical Load BalancingProceedings of the 15th ACM International Conference on Future and Sustainable Energy Systems10.1145/3632775.3661938(291-307)Online publication date: 4-Jun-2024
      • (2024)Balancing Act: Evaluating People’s Perceptions of Fair Ranking MetricsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3659018(1940-1970)Online publication date: 3-Jun-2024
      • (2024)Fairness-Aware Exposure Allocation via Adaptive RerankingProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657794(1504-1513)Online publication date: 10-Jul-2024
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media