computational social science

November 4, 2019

Explainable AI and computational approaches to macroeconomic theory

I have spent some time working with and around people concerned with the ethical implications of AI. A question that arises frequently in that context is to what extent automated decisions made by computational systems are “explainable” or “scrutable” (e.g. Selbst and Barocs, 2018). An important motivation for this line of inquiry is the idea that for AI systems to be effectively regulated by the Rule of Law, they need to be comprehensible to lawyers and understood within lawyerly discursive pracice (Hildebrandt, 2015). This is all very interesting, but analyses of the problem and its potential solutions rarely transcend the disciplinary silos from which the ‘explainability’ concerns originate. I’ve written my opinions about this quite a bit on this blog and I won’t reiterate them.

Instead, I’ve changed what I’m working now. Now I am contributing to open source software libraries for computational methods in macroeconomics, such as the Heterogeneous Agents Resources and toolKit (HARK). This is challenging and rewarding work. One reason why it is challenging and rewarding is how it bumps up against many key issues in the way computational methods are changing social sciences education. This is in many ways related to the explainable AI problem, though it’s in some sense the opposite side of the coin.

I’ll try to explain. Macroeconomic theory, which deals with such problems as how the economy as a whole reacts to changing trends in saving, consumption, and employment, and how agents within the economy react to those aggregate phenomena, has a long history associated with some major heavyweight economists: Keynes, Mankiw, etc. It is a deeply mathematical field that is taken seriously by central banks around the world and, by extension, private banks as well. Regulating the economy is an important job that requires expertise and is an intrinsically quantitatively understood operation; whatever one may think about the field of economics in general or its specific manifestations in history, it’s undeniable that the world needs economists of one kind or another.

So we have here a form of public policy expertise that is not discursive in the same sense that lawyerly practice is discursive. Economics has always imagined itself to be a science, however hotly contested that claim may be. It is also a field that does not shy away from having specialized disciplinary knowledge that must be accessed through demanding training. So economics would seem to be a good domain for computational methods to take root.

I’m finding that there are still challenges of interpretation in this field, but that they are somewhat different. Consider for now only the class of economic models that are built from a priori assumptions without any fitting to empirical data. Classically, economic models were constrained by their analytic tractability, meaning the ability of the economist to derive the results of the model through symbolic manipulation of the model’s mathematical terms. This led to the adoption of many assumptions of questionable realism, which have arguably led to some of the discrediting of economic theory since. But it also led to models that had closed form solutions, which have the dual advantage of being easy to compute (in terms of computational cost) and being easy to interpret, because the relationship between variables is explicit.

With computational models, the modeler has more flexibility. They can plug in the terms of the model and run a simulation to compute the result. But while the relationships between the input and output of the simulation may be observable in some sense in this case, the relationship is not proven. The simulation is not as good for purposes of exposition, or teaching, or explanation.

This is quite interesting, as it is a case where the explainability of a computational system is problematic but not because of a numeric or technical illiteracy on the part of the model reader, or of any intentional secrecy, but rather because of the complexity of the simulation (Burrell, 2016). For the purposes of this discussion, I’ve been discussing model building only, not model fitting, so the complexity in this case does not come from the noisiness of reality and the data it provides. Rather, the complexity results entirely from the internals of the model.

It is now a true word often spoken in jest that most machine learning today is some form of glorified (generalized) linear regression. The class of models considered by machine learning methods today is infinitely wide but ultimately shallow. Even when a need to understand the underlying phenomenon is abandoned, the available range of algorithms and hardware constraints limits machine-learnt models to those that are tractable by, say, a GPU.

But something else can be known.

References

Burrell, Jenna. “How the machine ‘thinks’: Understanding opacity in machine learning algorithms.” Big Data & Society 3.1 (2016): 2053951715622512.

Hildebrandt, Mireille. Smart technologies and the end (s) of law: Novel entanglements of law and technology. Edward Elgar Publishing, 2015.

Selbst, Andrew D., and Solon Barocas. “The intuitive appeal of explainable machines.” Fordham L. Rev. 87 (2018): 1085.

Leave a comment

December 13, 2017

transcending managerialism

What motivates my interest in managerialism?

It may be a bleak topic to study, but recent traffic to this post on Marcuse has reminded me of the terms to explain my intention.

For Marcuse, a purpose of scholarship is the transcendent project, whereby an earlier form of rationality and social totality are superseded by a new one that offers “a greater chance for the free development of human needs and faculties.” In order to accomplish this, it has to first “define[] the established totality in its very structure, basic tendencies, and relations”.

Managerialism, I propose, is a way of defining and articulating the established totality: they way everything in our social world (the totality) has been established. Once this is understood, it may be possible to identify a way of transcending that totality. But, the claim is, you can’t transcend what you don’t understand.

Marx had a deeply insightful analysis of capitalism and then used that to develop an idea of socialism. The subsequent century indeed saw the introduction of many socialistic ideas into the mainstream, including labor organizing and the welfare state. Now it is inadequate to consider the established totality through a traditional or orthodox Marxist lens. It doesn’t grasp how things are today.

Arguably, critiques of neoliberalism, enshrined in academic discourse since the 80’s, have the same problem. The world is different from how it was in the 80’s, and civil society has already given what it can to resist neoliberalism. So a critical perspective that uses the same tropes as those used in the 80’s is going to be part of the established totality, but not definitive of it. Hence, it will fail to live up to the demands of the transcendent project.

So we need a new theory of the totality that is adequate to the world today. It can’t look exactly like the old views.

Gilman’s theory of plutocratic insurgency is a good example of the kind of theorizing I’m talking about, but this obviously leaves a lot out. Indeed, the biggest challenge to defining the established totality is the complexity of the totality; this complexity could makes the transcendent project literally impossible. But to stop there is a tremendous cop out.

Rather, what’s needed is an explicit theorization of the way societal complexity, and society’s response to it, shape the totality in systematic ways. “Complexity” can’t be used in a fuzzy way for this to work. It has to be defined in the mathematically precise ways that the institutions that manage and create this complexity think about it. That means–and this is the hardest thing for a political or social theorist to swallow–that computer science and statistics have to be included as part of the definition of totality. Which brings us back to the promise of computational social science if and when it includes its mathematical methodological concepts into its own vocabulary of theorization.

References

Benthall, Sebastian. “Philosophy of computational social science.” Cosmos and History: The Journal of Natural and Social Philosophy 12.2 (2016): 13-30.

Gilman, Nils. “The twin insurgency.” American Interest 15 (2014).

Marcuse, Herbert. One-dimensional man: Studies in the ideology of advanced industrial society. Routledge, 2013.

Leave a comment

May 11, 2017

Hurray! Epstein’s ‘generative’ social science is ‘recursive’ or ‘effectively computable’ social science!

I’m finding recent reading on agent-based modeling profoundly refreshing. I’ve been discovering a number of writers with a level of sanity about social science and computation that I have been trying to find for years.

I’ve dipped into Joshua Epstein’s Generative Social Science: Studies in Agent-Based Computational Modeling (2007), which the author styles as a sequel to the excellent Growing Artificial Societies: Social Science from the Bottom Up (1996). Epstein explains that while the first book was a kind of “call to arms” for generative social science, the later book is a firmer and more mature theoretical argument, in the form of a compilation of research offering generative explanations for a wide variety of phenomena, including such highly pertinent ones as the emergence of social classes and norms.

What is so refreshing about reading this book is, I’ll say it again, the sanity of it.

First, it compares generative social science to other mathematical social sciences that use game theory. It notes that, though there are exceptions, the problem with these fields is their tendency to see explanation in terms of Nash equilibria of unboundedly rational agents. There’s lots of interesting social phenomena that are not in such an equilibrium–the phenomenon might itself be a dynamic one–and no social phenomenon worth mentioning has unboundedly rational agents.

This is a correct critique of naive mathematical economic modeling. But Epstein does not throw the baby out with the bathwater. He’s advocating for agent-based modeling through computer simulations.

This leads him to respond preemptively to objections. One of these responses is “The Computer is not the point”. Yes, computers are powerful tools and simulations in particular are powerful instruments. But it’s not important to the content of the social science that the simulations are being run on computers. That’s incidental. What’s important is that the simulations are fundamentally translatable into mathematical equations. This follows from basic theory of computation: every computed program is equivalent to some mathematical function. Hence, “generative social science” might as well be called “recursive social science” or “effectively computable social science”, he says; he took the term “generative” from Chomsky (i.e. “generative grammer”).

Compare this with Cederman’s account of ‘generative process theory‘ in sociology. For Cederman, generative process theory is older than the theory of computation. He locates its origin in Simmel, a contemporary of Max Weber. The gist of it is that you try to explain social phenomena by explaining the process that generates it. This is a triumphant position to take because it doesn’t have all the problems of positivism (theoretical blinders) or phenomenology (relativism).

So there is a sense in which the only thing Epstein is adding on top of this is the claim that proposed generative processes be computable. This is methodologically very open-ended, since computability is a very general mathematical property. Naturally the availability of computers for simulation makes this methodological requirement attractive, just as ‘analytic tractability’ was so important for neoclassical economic theory. But on top of its methodological attractiveness, there is also an ontological attractiveness to the theory. If one accepts what Charles Bennett calls the “physical Church theory”–the idea that the Church-Turing thesis applies not just to formal systems of computation but to all physical systems–then the foundational assumption of Epstein’s generative social science holds not just as a methodological assumption.

This was all written in 2007, two years before Lazer et al.’s “Life in the network: the coming age of computational social science“. “Computational social science”, in their view, is about the availability of data, the Internet, and the ability to look at society with a new rigor known to the hard sciences. Naturally, this is an important phenomenon. But somehow in the hype this version of computational social science became about the computers, while the underlying scientific ambition to develop a generative theory of society was lost. Computability was an essential feature of the method, but the discovery (or conjecture) that society itself is computation was lost.

But it need not be. Just a short dip into it, Epstein’s Generative social science is a fine, accessible book. All we need to do is get everybody to read it so we can all get on the same page.

References

Cederman, Lars-Erik. “Computational models of social forms: Advancing generative process theory 1.” American Journal of Sociology 110.4 (2005): 864-893.

Epstein, Joshua M., and Robert L. Axtell. “Growing artificial societies: Social science from the bottom up (complex adaptive systems).” (1996).

Epstein, Joshua M. Generative social science: Studies in agent-based computational modeling. Princeton University Press, 2006.

Lazer, David, et al. “Life in the network: the coming age of computational social science.” Science (New York, NY) 323.5915 (2009): 721.

Leave a comment

May 5, 2017

Society as object of Data Science, as Multi-Agent System, and/or Complex Adaptive System

I’m drilling down into theory about the computational modeling of social systems. In just a short amount of time trying to take this task seriously, I’ve already run into some interesting twists.

A word about my trajectory so far: my background, such as it is, has been in cognitive science and artificial intelligence, and then software engineering. For the past several years I have been training to be a ‘data scientist’, and have been successful at that. This means getting a familiarity with machine learning techniques (a subset of AI), the underlying mathematical theory, software tooling, and research methodology to get valuable insights out of unstructured or complex observational data. The data sets I’m interested are as a rule generated by some sort of sociotechnical process.

As much as the techniques of data science lead to rigorous understanding of data at hand, there’s been something missing from my toolbox, which is the appropriate modeling language for social processes that can encode the kinds of implicit theories that my analysis surfaces. Hence the transition I am attempting to go from being a data scientist, a diluted term, to a computational social scientist.

The difficulty, navigating as I am out of a very odd intellectual niche, is acquiring the theoretical vocabulary that bridges the gap between social theory and computational theory. In my training at Berkeley’s School of Information, frequently computational theory and social theory have been assumed to be at odds with each other, applying to distinct domains of inquiry. I gather that this is true elsewhere as well. I have found this division intellectually impossible to swallow myself. So now I am embarking on an independent expedition into the world of computational social theory.

One of pieces that’s grounding my study, as I’ve mentioned, is Cederman’s work outline the relationship between generative process theory, multi-agent simulations (MAS), and computational sociology. It is great work for connecting more recent developments in computational sociology with earlier forms of sociology proper. Cederman cites interesting works by R. Keith Sawyer, who goes into depth about how MAS can shed light on some of the key challenges of social theory: how does social order happen? The tricky part here is the relationship between the ‘macro’ level ‘social forms’ and the ‘micro’ level individual actions. I disagree with some of Sawyer’s analysis, but I think he does a great of setting up the problem and its relationship to other sociological work, such as Giddens’s work on structuration.

This is, so far, all theory. As a concrete example of this method, I’ve been reading Epstein and Axtell’s Growing Artificial Societies (1996), which I gather is something of a classic in the field. Their Sugarscape model is very flexible and their simulations shed light on timeless questions of the relationship between economic activity and inequality. Their presentation is also inspiring.

As a rule I’m finding the literature in this space far more accessible than I would have expected. It’s often written in very plain language and depends more on the power of illustration than scientific terminology laden with intellectual authority. What I have encountered so far is, perhaps as a consequence, a little unsatisfying intellectually. But it’s all quite promising.

Based on these leads, I was recommended David Little’s recent blog post about complexity in social science. He’s quite critical of the bolder claims of these scientists; I’d like to revisit these arguments later. But what was most valuable for me were his references. One was a book by Epstein, who I gather has gone on to do a lot more work since co-authoring Growing Artificial Societies. This seems to continue in the vein of ‘generative’ modeling shared by Cederman.

But Little references two other sources: John Holland’s Complexity: A Very Short Introduction and Miller and Page’s Complex Adaptive Systems: An Introduction to Computational Models of Social Life.

This is actually a twist. Holland as well as Miller and Page appear to be concerned mainly with complex adaptive systems (CAS), which appear to be more general than MAS. At least, in Holland’s rendition, which I’m now reading. MAS, Cederman and Sawyer both argue, is inspired in part by Object Oriented Programming (OOP), a programming paradigm that truly does lend itself to certain kinds of simulations. But Holland’s work seems more ambitious, tying CAS back to contributions made by von Neumman and Noam Chomsky. Holland is after a general scientific theory of complexity, not a specific science of modeling social phenomena. Perhaps for this reason his work echoes some work I’ve seen in systems ecology on autocatalysis and Varela’s work on autopoiesis.

Indeed the thread of Varela may well lead to where I’m going. One paper I’ve seen ties computational sociology to Luhmann’s theory of communication; Luhmann drew on Varela’s ideas of autopoeisis explicitly. So there is likely a firm foundation for social theory somewhere in here.

These are fruitful investigations. What I’m wondering now is to what extent the literatures on MAS and CAS are divergent.

Leave a comment

April 23, 2017

Process theory; generative epistemology; configurative ontology: notes on Cederman, part 1

I’ve recently had recommended to me the work of L.E. Cederman, who I’ve come to understand is a well-respected and significant figure in computational social science, especially agent based modeling. In particular, I’ve been referred to this paper on the theoretical foundations of computational sociology:

Cederman, L.E., 2005. Computational models of social forms: Advancing generative process theory 1. American Journal of Sociology, 110(4), pp.864-893. (link)

This is a paper I wish I had encountered years ago. I’ve written much here about my struggles with “interdisciplinary” research. In short: I’ve been trying to study social phenomena with scientific rigor. This is a very old problem fraught with division. On top of that, there’s been, it seems, an epistemological upset because of advances in data collection and processing that poses a practical challenge to a lot of established disciplines. On top of this, the social phenomena I’m interested in most tend to involve the interaction between people and technology, which brings with it an association with disciplines specialized to that domain (HCI, STS) that for me have not made my research any more straightforward. After trying for some time to do the work I wanted to do under the new heading of data science, I did not find what I was looking intellectually in that emerging field, however important the practical skill-set involved has been to me.

Computational social science, I’ve convinced myself if not others, is where the answers lie. My hope for it is that as a new discipline, it’s able to break away from dogmas that limited other disciplines and trapped their ambitions in endless methodological debates. What is being offered, I’ve imagined, in computational social science is the possibility of a new paradigm, or at least a viable alternative one. Cederman’s 2005 paper holds out the promise for just that.

Let me address for now some highlights of his vision of social science and how they relate to the other. I hope to come to the rest in a later post.

Sociological process theory. This is a position in sociological theory that Cederman attributes to 19th century sociologist Georg Simmel. The core of this position is that social reality is not fixed, but rather result of an ongoing process of social interactions that give rise to social forms.

“The large systems and the super-individual organizations that customarily come to mind when we think of society, are nothing but immediate interactions that occur among men constantly every minute, but that have become crystallized as permanent fields, as autonomous phenomena.” (Simmel quoted in Wolf 1950, quoted in Cederman 2005)

There is a lot to this claim. If one is coming from the field of Human Computer Interaction (HCI), what may seem most striking about it is how well it resonates with a scholarly tradition that is most frequently positioned as a countercurrent to an unthinking positivism in design. Lucy Suchman, Etienne Wenger, and Jean Lave are scholars that come to mind as representative of this way of thinking. Much of the intellectual thrust of Simmel can be found in Paul Dourish’s criticism of positivist understandings of “context” in HCI.

For Dourish, the intellectual ground of this position is phenomenological social science, often associated with ethnomethodology. Simmel predates phenomenology but is a neo-Kantian, a contemporary of Weber, and a critic of the positivism of his day (the original positivism). As a social scientific tradition, it has had its successors (maybe most notably George Herbert Mead) but has submerged under other theoretical traditions. From Cederman’s analysis, one gathers that this is largely due to process theory’s inability to ground itself in rigorous method. Its early proponents were fond of metaphorical writing in a way that didn’t age well. Cederman pays homage to the sociological process theory’s origins, but quickly moves to discuss an epistemological position that complements it. Notably, this position is neither positivist, nor phenomenological, nor critical (in the Frankfurt School sense), but something else: generative epistemology.

Generative epistemology. Cederman positions generative epistemology primarily in opposition to positivism and particularly a facet of positivism that he calls “nomothetic explanation”: explanation in terms of laws and regularities. The latter is considered the gold standard of natural science and the social sciences that attempt to mimic them. This tendency is independent of whether the inquiry is qualitative or quantitative. Both comparative analysis and statistical control look for a conjunction of factors that is regularly predictive of some outcome. (Cederman’s sources on this: (Gary) King, Keohane, and Verba (1994), and Goldthorpe, 1997. The Gary King cited is I assume the same Gary King who goes on to run Harvard’s IQSS; I hope to return to this question of positivism in computational social science in later writing. I tend to disagree with the idea that ‘data science’ or ‘big data’ has primarily a positivist tendency.)

Cederman describes the ‘process theorist’s’ alternative as based on abduction, not induction. Recall that ‘abduction’ was Peirce’s term for ‘inference to the best explanation’. The goal is to take an observed sociological phenomenon and explain its generation by accounting for how it is socially produced. The preference for generative explanation, in Simmel, comes in part from a pessimism about isolating regularities in complex social systems. Through this theorization, knowledge is gained; the knowledge gained is a theoretical advance that makes a social phenomenon less ‘puzzling’.

“The construction of generative explanations based on abductive inference is an inherently theoretical endeavor (McMullin, 1964). Instead of subsuming observations under laws, the main explanatory goal is to make a puzzling phenomenon less puzzling, something that inevitably requires the introduction of new knowledge through theoretical innovation.”

The specifics of the associated method are less clear than the motivation for this epistemology. Many early process theorists resorted to metaphors. But where all this is going is into the construction of models, and especially computational models, as a way of presenting and testing generative theories. Models generate forms through logical operations based on a number of parameters. A comparison between the logical form and the empirical form is made. If it favorable, then the empirical form can be characterized as the result of a process described by the variables and model. (Barth, 1981)

Cederman draws from Barth (1981) and Thomas Fararo (1989) to ally himself with ‘realist’ social science. The term is clarified later: ‘realism’ is opposed to ‘instrumentalism’, a reference that cuts to one of core epistemological debates in computational methods. An instrumental method, such as a machine learning ensemble, may provide a very instrumental model for purposes of prediction and control that nevertheless does not capture what’s really going on in the underlying process. Realist mathematical sociology, on the other hand, attempts to capture the reality of the process generating the social phenomenon in the precise language of processing, mathematics/computation. The underlying metaphysical point is one that many people would rather not attend to. For now, we will follow Cederman’s logic to a different ontological point.

Configurative ontology. Sociological process theory requires explanations to be specify the process that generates the social form observed. The entities, relations, and mechanisms may be unobserved or even unobservable. Postivists, Cederman argues, will often take the social forms to be variables themselves and undertheorize how the variables have been generated, since they care only about predicting actual outcomes. Whereas positivists study ‘correlations’ among elements, Simmel studies ‘sociations’, the interactions that result in those elements. The ontology, then, is that social forms are “configurations of social interactions and actors that together constitute the structures in which they are embedded.”

In this view, variables, such as would be used in some more positivist social scientific study, “merely measure dimensions of social forms; they cannot represent the forms themselves except in very simple cases.” While a variable based analysis detaches a social phenomenon from space and time, “social forms always possess a duration in time and an extension in space.

Aside from a deep resonance with Dourish’s critique of ‘contextual computing’ (noted above), this argument once again recalls much of what now comes under the expansive notion of ‘criticism’ of social sciences. Ethnomethodology and ethnography more general are now often raised as an alternative to simplistic positivist methods. In my experience at Berkeley and exposure so far to the important academic debates, the most noisy contest is between allegedly positivist or instrumentalist (they are different, surely) quantitative methods and phenomenological ethnographic methods. Indeed, it is the latter who more often now claim the mantle of ‘realism’. What is different about Cederman’s case in this paper is that he is setting up a foundation for realist sociology that is nevertheless mathematized and computational.

What I am looking for in this paper, and haven’t found yet, is an account of how these ‘realist’ models of social processes are tested for their correspondence to empirical social form. Here is where I believe there is an opportunity that I have not yet seen fully engaged.

Leave a comment

January 23, 2017

update: no longer think “hacker class consciousness” is important

I’m going through old papers and throwing them out. I came upon an early draft from my first year in graduate school titled “Hacker Class Consciousness”. It was the beginning of an argument that those that work on open source software needed to develop a kind of class consciousness recognizing that their work bears a special relationship to capitalist modes of production. Open source software is a form of capital (a means of production) that is not privately owned. Hence, it is actually quite disruptive to capitalism per se. A la early Marxist theory, a political identity or “class consciousness” of people working in this way was necessary to reform the government to make it more equitable, or environmentally friendly, less violent, or whatever your critique of capitalism (or neoliberalism, if you prefer) is.

I didn’t get very far past this basic economic logic, which I still think is correct. I no longer think that class consciousness is important though. And I don’t think there’s an inevitability to capitalism containing the seeds of its own revolution through the eventual triumph of open source production.

I think it’s a good practice to make oneself accountable when one changes ones mind. There’s lots of evidence to say that when people publicly commit to some belief, they wind up sticking to it with more confidence than they ought to. Shame related reasons, I suppose. A good alternative habit, I believe, is publicly admitting when you are wrong about something, with the reasons for the update.

So why did I change my mind on this? Well, one reason is that I took some shots at formally modelling the problem several years ago and while it showed the robustness of open source software as a way of opening a market that had previously been dominated or locked in by a proprietary vendor or solution, there isn’t the profit motive driving open source production as a first mover. So the natural pressures of the market make open source coexist alongside proprietary systems, providing a countervailing force to privatization but never dissolving it entirely.

Another reason I changed my mind was a more general shift away from Marxist to Bourdieusian modes of thinking, which I’ve talked about here. A key part of this change in perspective is that it sees many kinds of capital at work in society, including both economic and cultural forms, and populations are distributed across the resulting multidimensional spectrum of variation, not stratified into a one-dimensional class structure. In such a world, class consciousness is futile. This futility may explain the futility of the Marxist project in general, as there was never really the kind of global collective action of the proletariat that he predicted would end capitalism. There’s always too many other kinds of population difference at work to allow for such a revolution. Race, for example.

It is good that a matured attitude has left me less eager to engage in a futile revolutionary project. There’s nothing like pursuing a doctorate for grinding that kind of idealism out of you. Now I can scintillate with cynicism, and would like to be much better at it. Which is to say, I’m beginning to regret ever turning away from the dismal science of economics, which now seems much more like the doctrine worth pursuing and improving.

One nice thing about economics is that it is quantitatively rigorous. This is not simply an intellectual gate-keeping statement designed to box out the innumerate. It’s rather a comment on how such a field has strictly more expressive power because of its capacity to represent a statistical distribution of variation. It’s not enough to say there’s black and white when there are shades of gray. And it’s not enough to say there are shades of gray when the particular variation in density of light across the field is what’s important.

A grayscale raster, from the OpenGeo Suite

It’s this kind of expressive power that gives computational social science much of its appeal. I forgot to even make this argument in my paper about the subject. That may be because this notion of the expressive power of different representational systems is part of what one learns in the course of ones computer science education, and that argument was written primarily for people without a computer science education.

Which really brings the discussion back around to where I come down to on the revolutionary economic potential of software development. Which is that really, it’s about educating people in the concepts and skills that allow them to make use of this incredible pool of openly available technical capital that gives people the “class consciousness” to act with it. Since late modern software development depends for its very existence on the great open wealth of collectivized logic already crystallized into free code, the “consciousness” is really just the habitus of the developer. I suppose I occasionally meet somebody who says they’ve been coding in .NET for their whole careers, but they are rare and I think are not doing well in the greater information economy.

It no coincidence that technical education and skills diffusion are, for Thomas Piketty, the way to counteract the inequality the results from disparate returns on wealth versus labor. This is a position one simply converges on if one studies it for long enough. Kindly, it stabilizes the role of the education system as one that is necessary for correcting other forms of societal destabilization and excess.

Leave a comment

December 23, 2015

Bourdieu and the possibility of interdisciplinary social science research

Bourdieu (Science of Science and Reflexivity, 2004) is interested in an account of science that has both sociological realism and trans-historical legitimacy. The importance of this project is obvious to any acting scientist who both contends with their social reality and aims to discover trans-historical knowledge. Trans-historical knowledge is incentivized by specific social institutions that create and preserve symbolic capital for scientists precisely according to the principle that their discoveries survive the test of time. If the knowledge survives only because it is propped up by temporal institutions that do not have such transcendent aspirations, then it is by definition not science.

There are all sorts of other academic vocations that do not have these transcendent aspirations, especially in what are broadly considered the social sciences. These include: ethnographers who explicitly do not aim for their results to generalize, historians who explicitly aim to elucidate the historical contingency and context of their objects of study, researchers who study organizations with the intention to inform their audience of matters of immediate political interest, and writers who offer a contextualized critique of an aspect of society in light of a tradition of scholarly literature. These vocations are not scientific, in Bourdieu’s sense, because they are not participating in a social field whose self-declared purpose is the discovery of trans-historical truth.

Rather, these researchers participate in other social fields, called “disciplines”. Because unscientific disciplines do not aspire to trans-historical knowledge, they see nothing wrong with carrying out research that is consistent with the contingent norms of their social environment, despite knowing full well that these norms stifle complete understanding of their phenomena of interest. Indeed there may be nothing wrong with this except from the perspective of a scientist judging this activity through the criteria of science. These disciplines attempt to accomplish the permanence of their symbolic capital through reproduction of their discipline specifically, as opposed to the reproduction of scientific method and knowledge generally.

If you go around an interdisciplinary context in a university and start telling non-scientists “You are not a scientist!” one is likely to elicit an affronted reaction. This is due to “established divisions in the long running debates about scientific method and the legitimacy of social science and humanistic inquiry,” and the resulting disciplinary hierarchy. Because science proper is hierarchically “above” social science and humanistic inquiry, pointing out that somebody is not a scientist is often interpreted as rude in the uniquely touchy culture of the academy. Researchers who are not scientists will deploy any number of strategies to recover status in this mixed social field, including: declaring themselves to be scientists (according to a more relaxed standard); declaring the distinction between science and non-science to be epistemically illegitimate (thereby weakening the status of science per se); and appealing to broader democratic principles of social inclusion and equality to motivate their inclusion within the scientific field.

However valuable democratic inclusion may be, appeals to it are not like the other strategies which are directed at the demarcation problem (the question of “what is science?” and by extension “what is not science?”) directly. My own opinion is that scientific inclusion is both very important and best achieved through good and equitably provided scientific education, and that good scientific education includes a transmission of scientific demarcation. In other words, because of the importance of social inclusion in science, it is essential to be be clear about what kinds of activities and knowledge science excludes. To broadly include people, for democratic reasons, into a social field that is in fact not science does not accomplish the inclusiveness of science; it does something else. So in the interest of the democratic inclusivity of science I will continue to elaborate on the social challenges of scientific demarcation despite how rude or otherwise objectionable this line of inquiry is to many scholars who are not scientists.

Above I have contrasted scientific research, which participates in a generalized social field aimed specifically at transcending temporally and geographically locality, and disciplinary social research, which is aimed at the reproduction of a specific social field. This contrast is drawn in multiple dimensions, but these dimensions are not orthogonal. As this can be confusing, I will attempt to untie these threads.

There is the distinction between scientific research and social research, which will immediately be recognized as a false dichotomy. Perhaps because of the strategic blurring of scientific demarcation mentioned above, the term “social science” is used problematically to mean both scientific and unscientific research into social phenomena. The hierarchy of the “social sciences” (economics, political science, sociology, anthropology) reflects the amount to which these disciplines adopt scientific methods. Scientific methods depend on scientific instruments developed using the discoveries of the exact sciences (such as mathematics, statistics, and foundational computer science). Because of this, we have seen more and more non-social sciences being applied to social phenomena, further confusing the idea of “social science”.

To clarify this problem, it is therefore useful to discuss “social research” broadly, and then address separately the question of how scientific a discovery or discipline of social research is. As should be clear from the preceding discussion, part of what makes social research more scientific is its ability to transcend its specific disciplinary context and be integrated in a generalized scientific field that aims specifically at that transcendence.

“Interdisciplinary” social research, therefore, will be easiest when the disciplines involved are more scientific, because the scientific imperative is precisely to transcend disciplinary and other local constraints. The less scientific a discipline is, the more it will resist interdisciplinary integration because it will not be serving the function of disciplinary reproduction.

This analysis clarifies why “interdisciplinary” social research is so highly sought after but so rarely achieved. This is because it is sought after by disciplines for conflicting reasons. A more scientific discipline will be motivated to interdisciplinary work out of its native purpose to transcend its own historical constraints, assimilating into itself the specific insights of a historically specific discipline while excluding the contingent elements. The less scientific discipline will, in contrast, pursue interdisciplinary research in order to blur scientific demarcation but will vigorously maintain its historical specificity in spite of the scientific imperative.

Today we see the profound success of interdisciplinary research, and interdisciplinary social research in particular, in ‘data science’, a term whose ambiguity signals the contiguity of all disciplines that are sufficiently scientific. The globalized field of data science, enabled largely through the sharing of software source code that operates identically across many and various contexts, transcends especially the contexts of academy, industry, and government. To the data scientist, discipline is irrelevant once it is subsumed by science.

This presents a crisis to disciplinary social research: either it must become “interdisciplinary” with data science, losing its disciplinary specificity. Or it must maintain its disciplinary integrity and autonomy at the expense of its trans-historical permanence as historical conditions change with the rise of data science. With either option, the disciplinary social sciences face their own mortality.

3 Comments

Tag: computational social science