Rational Inattention and Organizational Focus

October 30, 2017 | Author: Anonymous | Category: N/A

Share Embed

Report this link

Short Description

Walter Isaacson's biography of Steve Jobs, on the. walter isaacson rational number ......

Description

Rational Inattention and Organizational Focus Wouter Dessein

Andrea Galeotti

Tano Santos

Columbia University

University of Essex

Columbia University

June 4, 2014

Abstract We examine the allocation of attention in team production. Coordinated adaptation to task-speci…c shocks requires that agents pay attention to each other, but attention is scarce. At the optimum, all attention is evenly allocated to a select number of “leaders”. The organization excels in a small number of focal tasks at the expense of all others. Our results shed light on the importance of leadership, strategy and “core competencies,”as well as new trends in organization design. We also derive implications for the optimal size of organizations. Surprisingly, improvements in communication technology may result in smaller but more adaptive organizations.

We thank Ricardo Alonso, Arthur Campbell, Roland Benabou, Luis Garicano, Navin Kartik, Jin Li, and Tim Van Zandt for helpful comments, as well as seminar and conference audiences at Brown, Columbia GSB, Essex, EUI, Harvard Business School, Northwestern Kellogg, MIT Sloan, Paris School of Economics, Pompeu Fabre, Princeton, Queens, Rochester, Toronto, Tsinghua University, UCL, ULB, Warwick, the CEPR Conference on Incentive, Management and Organizations (Milan), ESEM Malaga, ESSET Gerzensee, and NYU Stern IO Day. Andrea Galeotti is grateful to the European Research Council for support through ERC-starting grant (award no. 283454) and to The Leverhulme Trust for support through the Philip Leverhulme Prize.

Economics, according to Lionel Robbins’s famous dictum, is “the science which studies human behavior as a relationship between ends and scarce means which have alternative uses.”As emphasized by Herbert Simon, attention may well be the ultimate scarce resource in the economy: “a wealth of information creates a poverty of attention and a need to allocate that attention e¢ ciently among the overabundance of information sources,” (Simon 1971, pp. 40–41). This paper studies the optimal allocation of (scarce) attention in organizations. Despite a fully symmetric environment and decreasing marginal returns to attention, we show that when attention is scarce, organizational focus and leadership naturally arise as a response to organizational trade-o¤s between coordination and adaptation. As such, our results provide micro-foundations for a central idea in the management literature that …rms should focus on a limited set of “core competencies”(Prahalad and Hamel 1990)1 and …rms that aim to be “all things to all people,” will be “caught in the middle” and fail (Porter 1985;1996). We develop comparative statics as to when organizational focus is more or less important, and shed light on new trends in organizational design including more shared leadership, an increase in horizontal communication linkages, and a reduction in the overall size of the organization (downsizing and downscoping).2 Our starting point is that organizations have a limited communication capacity and must use this resource judiciously in order to coordinate their production processes.3 We propose a model of team production in which a number of complementary tasks, such as engineering, purchasing, manufacturing, marketing and selling must be implemented in a coordinated fashion. Alternatively, di¤erent tasks may correspond to di¤erent products or locations (e.g. a multi-product …rm which exploits economies of scope). Each agent is in charge of one task and must adapt this task to local information or “shocks”. Such adaptation, however, results in coordination failures with other tasks unless agents communicate e¤ectively. Organizational focus, then, takes the form of allocating more scarce organizational attention to one task –or one agent –than another task. 1

According to Prahalad and Hamel, such core competences represent the “collective learning in the

organization, especially how to coordinate diverse production skills” (p. 82) 2 See Whittington et al. (1999), Guadalupe et al. (2012), and Roberts and Saloner (2013). 3 As Arrow (1974, p.53) stated “The information has to be coordinated if it is to be of any use to the organization. More formally stated, communication channels have to be created within the organization.”

1

If attention is abundant there is no need for organizational focus in our model. All tasks can then be very adaptive and well coordinated, and it is optimal to distribute attention evenly. In contrast, if attention is scarce and coordination is important, it is optimal to treat tasks asymmetrically. A few agents should then be allowed to be very responsive to their local information, and all attention should be focused on those agents and their tasks in order to avoid coordination failures. In contrast, coordination with all other tasks is achieved by limiting their adaptiveness. All tasks are then well coordinated, but only a few tasks are adaptive.4 Leadership, where a few agents monopolize scarce attention and take most of the initiative, arises endogenously. In contrast, a “balanced”organization that spreads attention evenly across tasks is “stuck in the middle”: tasks are neither very adaptive nor are they very well coordinated. Importantly, the above result obtains even when all tasks and agents are ex ante identical and there are decreasing marginal returns to attention: as in most standard models of information acquisition, we assume that every additional unit of attention allocated to an agent reduces the uncertainty about this agent’s actions by an ever smaller amount. The mechanism underlying the above result is a fundamental complementarity between the attention devoted to an agent, and the initiative taken by this agent. Agents take initiative by adapting their task to local information. But agents who are ignored by others are forced to also largely ignore their own private information, as taking initiative would then result in substantial coordination failures. Conversely, it is a waste of resources to allocate scarce attention to an agent who takes little or no initiative. Following the same logic, the more attention an agent receives, the more initiative this agent can take, and the more important it is to devote scarce attention to this agent in order to ensure coordination. Because of the above complementarities, members in an organization either communicate intensively about a particular task, or they ignore it. As we show, an optimal organization then equally divides all attention among a select number of tasks or agents, which we refer 4

This correspond to the two general ways in which organizations can be coordinated according to March

and Simon (1958): “The type of coordination used in the organization is a function of the extent to which the situation is standardized. (...) We may label coordination based on pre-established schedules coordination by plan, and coordination that involves transmission of new information coordination by feedback. The more stable and predictable the situation, the greater the reliance on coordination by plan.” (p182)

2

to as ‘leaders’. The scarcer is attention, the smaller is the number of tasks on which the organization focuses. Interestingly, those chosen tasks then often receive much more attention – and are much more adaptive – than if attention were to be abundant. Our results support the notion that firms need to have a clear strategy – they must choose a set of performance dimensions or tasks in the value-chain to focus on.5 By the same token, we provide insights as to how focused firms should be. Over the last decades, there have been enormous technological innovations in communication and coordination technologies (E-mail, wireless communication and computing, intra networks, etc.). Our results suggest that having a narrow focus becomes less important as information technology relaxes the communication and attention constraints of organizations. The resulting organization is often less well coordinated, less cohesive, but has a broader focus – it pays attention to the task-specific information of a larger number of agents. This is consistent with new trends in organizational design towards more network-like organizations where communication flows are horizontal rather than vertical, and decision-making and influence is broadly shared in the organization. Such novel organizations have been documented in both case studies (for example “Proctor & Gamble Organization 2005,” HBS case 9-707-519)6 and large scale empirical studies (Whittington et al. 1999, Guadalupe et al. 2012).7 Conversely, in fastmoving environments where speedy decisions are important – a competitive market place where reacting quickly to a competitor’s move is of the essence, or a platoon in the field of battle – there is often little time for extensive communication.8 Our model predicts that 5 6

See Van den Steen (2013) for a different view and formalization of “what is strategy”. In this case study, Piskorski and Spadini document how P&G has moved towards a novel organizational

structure in which a separate product organization (responsible for global marketing and product development), a sales organization (responsible for delivery and customization to local markets), and a business services organization are interdependent units who are giving equal weight in decision-making processes, and achieve coordination through social networks and horizontal communication, rather than vertical authority relationships. In the past, geographically organized sales organizations dominated P&G, slowing down the development and roll-out of new products. 7 Guadalupe et al. document how, in recent decades, C-level executive teams in Fortune 500 firms have almost doubled in size, mainly because of the inclusion of more functional managers. 8 As argued by Roberts and Saloner (2013), increased competition, through globalization of markets and industries, “requires firms to be able to change more quickly and respond faster to market developments.”

3

leadership and organizational focus is more important in such environments compared to settings where extensive communication is feasible prior to taking action. In our basic model, the size or “scope” of the organization is fixed. In multi-product firms, however, different tasks may correspond to different types of products or services, and it is natural to think of the number of tasks as being endogenous. In Section 4, we therefore endogenize the number of tasks by introducing economies of scale or scope: certain fixed costs can be shared among tasks (e.g. production facilities or a distribution network), yielding benefits to size. The size of organizations, however, is limited by the need for coordination and limited organizational attention. When the number of tasks is endogenous, we uncover an important trade-off between organizational size and organizational focus. As we show, smaller organizations have more leaders and, hence, the information of more agents is reflected in decision-making. As an organization grows larger, leadership becomes more concentrated as there is more need for coordination. This is consistent with the experience of many entrepreneurial firms, whose culture of joint-decision-making and open lateral communication often disappears as they grow larger and more hierarchical.9 Two empirically relevant drivers of organizational size are the volatility of the environment and changes in information and communication technologies. Consistent with recent trends in “de-scoping”(Whittington et al. 1999), we show that the optimal scope of organizations decreases as the environment becomes more volatile and adaptation becomes more important.10 Intuitively, by reducing the number of tasks that it undertakes, the organization reduces its coordination needs, hence allowing for better adaptation. At the same time, the number of tasks that receive attention increases. Hence, as the environment becomes more volatile, there is a move from large, focussed organizations that maximize scope economies to smaller, but more adaptive and balanced organizations. Improvements in information (p822) 9 In a classic management article, Greiner (1972) discusses how young, creative organizations often face a crisis of leadership as they grow bigger. According to Greiner, an initial phase of creativity must give way to a phase of direction, where companies that continue to grow, do so under “able, directive leadership”. 10 As noted by Siggelkow and Rivkin (2005), ”rapid technological change, deregulation, and globalization have intensified competition and increased the turbulence that managers face, forcing them to adopt new, more responsive organizational forms.” (p101) See also footnote 8.

4

technology might be conjectured to always increase the size of organizations, as they allow for better coordination. Interestingly, we show that information technology has a decidedly ambiguous impact on firm scope. Intuitively, information technology makes it optimal for organizations to shift towards a strategy that emphasizes adaptation to its environment, but smaller firms with shared leadership are better configured to do so. Hence, while for low levels of information technology, large, non-adaptive firms exploiting economies of scale are optimal, for intermediate levels of information technology, smaller, more flexible firms are often preferred. Outline. After reviewing the related literature in Section 1, we describe our model in Section 2. Section 3 provides a characterization of the optimal allocation of attention. Section 4 endogenizes the number of tasks, with larger organizations exploiting economies of scope but facing more daunting coordination problems. We conclude in Section 5 by discussing the implications of our model for skill-heterogeneity and the entrenchment of leadership and functional cultures in organizations, as well as some missing elements such as power struggles and conflicts of interests. Proofs of Propositions, as well as model extensions, are relegated to the Appendix.

1

Literature Review

Our paper is part of a larger literature on team theory (Marschak and Radner 1972, Radner 1993), which studies games where agents share the same objective, but have asymmetric information. As Garicano and Van Zandt (2013) emphasize, team theory is a natural framework in which to analyze the questions raised in this paper, as these models study interrelated decision-making when information is dispersed and there are limits to communication, a canonical organizational problem.11 Our paper further belongs to a recent literature in organizational economics, most prominently Dessein and Santos (2006) (DS hereafter), Alonso et al. (2008), Rantakari (2008), and Bolton, Brunnermeir and Veldkamp (2013), that emphasizes coordination adaptation 11

See, e.g., Cremer (1980, 1993), Geanakoplos and Milgrom (1991), Prat (2002), Wernerfelt (2004), Hart

and Moore (2005), Cremer et al. (2007) and Alonso et al. (2013).

5

trade-offs as a key mechanism determining organizational form. Following Marshak and Radner (1972), all these papers study how organizations are optimally designed to coordinate production and assume a similar quadratic pay-off structure as our paper. Closest to us is DS, which studies optimal task specialization in organizations, but restricts communication flows to be symmetric; instead the present paper takes task-specialization as given and endogenizes communication patterns.12 Calvo-Armagenol, de Marti and Prat (2011) also endogenizes communication patterns in a framework similar to that of DS. Their focus, however, is on how asymmetries in pay-off externalities between pairs of agents result in asymmetric communication flows and differential influence for agents.13 Alonso et al. (2008) and Rantakari (2008) depart from the team theory formulation in DS by introducing incentive conflicts and by modelling communication as cheap talk.14 Cheap talk communication is not suitable to study the allocation of attention, however, as the quality of communication only depends on incentive conflicts between agents, and not on time or information-processing constraints. The focus of these papers is on the optimal allocation of decision-making authority whereas the present paper is concerned with the optimal allocation of organizational attention.15 A number of other papers study the optimal characteristics of leaders in adaptationcoordination models where economic actors must respond to a common “global” shock, but also care about choosing similar actions as other agents in the organization or economy.16 12

Autor (2009) and Bolton and Dewatripont (2013) offer textbook presentations of the DS model as well

as a description of its main characteristics. 13 In a symmetric set-up, there are no asymmetric communication patterns in Calvo-Armagnol et al. : each agent is equally influential and there are no leaders. In contrast, we show how leadership and asymmetric information flows arise naturally in symmetric settings.The main difference is that Calvo-Armagenal et al. posit a communication technology with strong decreasing marginal returns, always enough to overwhelm the convexities induced by the coordination-adaptation trade-offs. Other differences are that agents are self-interested and invest in both active and passive communication. 14 See also Hagenback and Koessler 2010, and Galeotti et al. 2013, which study strategic communication in networks. 15 Also Dewatripont (2006) introduces incentives in DS by modeling communication as a moral hazard in teams problem, drawing upon Dewatripont and Tirole (2005). Again, implications for the optimal allocation of decision-making authority are derived. 16 Our model differs from such beauty-contest models (Morris and Shin 2002, Hellwig and Veldkamp 2009)

6

Closest to us, Bolton, Brunnermeir and Veldkamp (2013) highlight the benefits of leader resoluteness in achieving coordination.17 Similarly, Van den Steen (2005) shows how having leaders with strong beliefs may be desirable as they give direction to the firm by affecting the employee’s choice of project. The present paper does not speak to the characteristics of successful leaders. Instead, we show how and when leadership arises endogenously in a team of ex ante identical agents. Finally, the arguments for organizational focus introduced in the present paper are reminiscent of at least two other literatures in organizational economics. In multitask incentive theory (Holmstrom and Milgrom, 1991 and 1994), a narrow task-assignment may allow a principal to provide higher-powered incentives to an agent. In multitask career concerns models, Dewatripont et al. (1999) show how incentives are impaired by an agent pursuing multiple objectives. The key insight in this literature is that it is easier to provide incentives to specialized agents. The above theories thus offer rationales for specialization at the individual level but are silent on the issue of organizational focus, which is the topic of this paper. More related to this paper, the literature on ‘narrow business strategies’ and ‘vision’ (Rotemberg and Saloner 1994, 2000) has argued that the commitment by a principal or leader to select a certain type of projects provides strong incentives for agents to exert effort related to such projects. In the same vein, Rantakari (2013b) shows how a principal optimally favors one agent over another (competing) agent in order to improve both idea generation and strategic communication. As in the multitask models above, ‘focus’ is thus again a tool to improve incentives.

2

The model

We posit a team-theoretic model, based on Dessein and Santos (2006), Alonso et al. (2008) and Rantakari (2008), in which production requires the combination of n tasks, each carried out by a different agent. The implementation of a task is informed by the realization of a in that agents must adapt their actions to local shocks, an assumption which is more suitable to study information flows within organizations. 17 See also Dewan and Myatt (2008) and Morris and Shin (2010), who show how the ability of a leader to convey her information clearly to followers is often more important than the precision of her information.

7

task-specific shock, only observed by the agent in charge of that task. Organizational tradeoffs arise because agents need to adapt to this privately observed shock while maintaining coordination across different tasks. Communication allows agents to partially share their local information with other members of the organization, but requires the use of scarce organizational resources (e.g. time, attention). The paper studies the optimal allocation of scarce communication capacity or ‘attention’.

2.1

Production

Production involves the implementation of n tasks, each performed by one agent i ∈ N = {1, 2, ..., n}. The profits of the organization depend on (i) how well each task is adapted to its organizational environment and (ii) how well each task is coordinated with the other tasks. For this purpose, agent i must take a primary action, qii ∈ R, and a coordinating action, qij ∈ R, for each task j ∈ N \ {i}. In particular, agent i observes a piece of information θi , a shock with variance σθ2 and mean 0, which is relevant for the proper implementation of the assigned task. We refer to θi as the local information of agent i and assume its realization is independent across agents. Let qi = [qi1 , qi2 ..., qii , ..., qin ] be the actions taken by agent i. Given a particular realization of the local information, θ = [θ1 , θ2 , ..., θn ], and a choice of actions, q = [q1 , q2 , ..., qn ], the realized profit of the organization is: π (q|θ) =

X

πi (q|θ) ,

(1)

i∈N

where, for each i ∈ N , 



πi (q|θ) = − (qii − θi )2 + β

X

(qji − qii )2 

(2)

j∈N \{i}

The performance of task i, as measured by πi , then depends on how well adapted agent i0 s primary action qii is to task i0 s local information θi and how well coordinated agent j 0 s coordinating action qji is to agent i0 s primary action qii . The parameter β > 0 measures task-interdependence and the importance of coordination relative to adaptation. Communication frictions and adaptation-coordination trade-offs: If agent i could perfectly communicate his primary action to agent j, there is no trade-off between adaptation and 8

coordination: agent i then optimally sets qii = θi and agent j ensures coordination by setting qji = qii . In the presence of communication frictions, however, adaptation-coordination tradeoffs arise. Indeed, assume agent j receives no information about qii and therefore sets qji = E(θi ) = 0. Then the more agent i adapts qii to θi the larger are the coordination costs with task j. By ignoring his local information θi and setting qii = 0, however, agent i can ensure perfect coordination with task j. Thus expression (1) captures the notion, going back to at least March and Simon (1958), that it is adaptation to unpredictable contingencies, combined with communication frictions, that create coordination problems.18 An alternative specification: In Appendix B we study an alternative specification which imposes qii = qij = qi so that each agent chooses a single action, in line with Alonso et al. (2008) and Rantakari (2008). In this case, even with perfect communication, agents face a trade-off between adapting their action to local information and maintaining coordination with the actions of other agents. While this makes the analysis less transparent, as each action now optimally depends on all local shocks, Appendix B shows that qualitatively identical results obtain.

2.2

Attention as a scarce resource and communication frictions

To improve coordination between tasks, agents can communicate their local information.19 Such communication will often be imperfect, but the more time or attention an agent j devotes to understanding agent i0 s primary action qii the more likely it is he understands which corresponding coordinating action qji is required to ensure coordination.20 18

“(D)ifficulties arise only if program execution rests on contingencies that cannot be predicted perfectly

in advance. In this case, coordinating activity is required (...) to provide information to each subprogram unit about the relevant activities of the others.” (p. 180). March and Simon also emphasize the role of ‘complementary actions’ in achieving coordination: “To the extent that contingencies arise, not anticipated in the schedule, coordination requires communication to give notice of deviations from planned or predicted conditions, or to give instructions for changes in activity to adjust to these deviations.” (p182). 19 Whether what is communicated is the realization of the local information or the choice of the primary action is not key because, as we shall show, there is a one-to-one correspondence between them. 20 Note that in our model, agents have no incentive to hide information, as in Campbell et al. (2013) and Blanes i Vidal and Moller (2013), or exaggerate information, as in Alonso et al. (2008) and Rantakari

9

We assume that communication occurs in public meetings so that each agent j 6= i devotes the same amount of time or attention to any given agent i. An attention allocation t = [t1 , t2 , ..., tn ], where ti ≥ 0 for all i ∈ N , then represents the “air-time” or “attention” any agent i receives. Alternatively, ti is the meeting time devoted to discussing task ti , regardless of whom speaks. Organizational attention is scarce, which is captured by the following attention constraint: X

ti ≤ τ,

(3)

i∈N

where τ < ∞. For example, τ could be the length of time agents spend in meetings as opposed to production. The assumption that τ is exogenous simplifies the analysis but it is not crucial for the results.21 In Appendix E we consider alternative models of communication such as bilateral communication (e.g. agent 3 may devote more attention to agent 1 than agent 2 does) and individual attention constraints (so that the time agents 3 and 4 communicate does not affect the attention constraints of agents 1 and 2). We show that they result in attention allocations that are equivalent to the ones that are optimal under public communication. We adopt the same binary communication technology as in Dessein and Santos (2006). Each agent j receives a message mji concerning θi . With a probability 1 − r(ti ), mji is pure noise and the choice of the relevant coordinating action qji cannot be made contingent on it. In contrast, with a probability r(ti ) agent j perfectly understands agent i, that is mji = θi . We posit the following attention function: 1 − r(ti ) = (1 − p)ti ,

(4)

where ti can be interpreted as the number of signals an agent j 6= i receives about θi and where each of those signals correctly reveal θi with an independent probability p. If ti is continuous rather than discrete, communication technology (4) can be viewed as a Poisson process where λ = ln(1 − p)−1 is the constant hazard rate that the receiver correctly learns (2008). As is common in the team-theory literature, we focus on non-strategic (technological) limits to communication. 21 We have derived a number of results for when τ is endogenous which show that our findings are robust to P an endogenous communication capacity. Specifically, we allow the designer to choose t at a cost g( i∈N ti ), where g(.) is a convex function. The analysis is presented in Appendix F.

10

the local information of the sender and, hence, 1 − r(ti ) = e−λti . We further assume that agent i never knows whether his communication with agent j 6= i was successful or not. Thus, when deciding on a primary action qii he takes into account that with a probability r(ti ) agent j will be informed about his choice. Note that the value of communicating an additional signal about θi – in term of increasing the probability r(ti ) of successful communication – is decreasing in the number of signals ti that have already been communicated. Indeed, the more signals that have been sent about θi the more likely it is that an agent j 6= i has already learned the realization of θi . A natural feature of communication technology (4) is thus that it implies decreasing marginal returns to communicating about a particular task-specific shock. An alternative specification: From (4), agent j 0 s expected residual variance about θi equals: RV (ti ) ≡ E [V ar(θi |ti signals about θi )] = σθ2 (1 − r(ti )) = σθ2 e−λti

(5)

As we formally discuss in Appendix D, an identical residual variance as in (5) obtains if messages mji and local information θi are normally distributed and the attention constraint (3) is modeled as a constraint on the total reduction in entropy, as in Information Theory (Cover and Thomas 1991) and the literature on Rational Inattention (Sims 2003). Since expected profits for a given attention allocation t can be written as a function of the residual variance, identical results obtain with this alternative communication technology.

2.3

Timing of the game

The timing of our model goes as follows: 1. Organizational design: Optimal attention allocation t is chosen. 2. Local information: Agent i ∈ N observes local information θi . 3. Communication: Agents allocate attention according to t. 4. Adaptation and Coordination: Agent i ∈ N chooses primary action qii and complementary actions qij , for all j ∈ N \ {i}.

11

We model the resulting agents’ action in stage four as the equilibrium outcome of a Bayesian Game with common payoffs.22 Equivalently, one can think of the designer specifying optimal decision rules in the organizational design stage (see Garicano and Van Zandt 2013). The type of agent i, denoted by Ii , describes the realization of his local information θi and the information mij he has received about the local shock of each agent j 6= i.

3

Organizational Focus

We now first provide a characterization of the optimal allocation of attention when there are two agents and two tasks. This is enough to convey many intuitions of the model. Section 3.2 then generalizes the result for the case of n tasks.

3.1

Organizational focus with two agents

Without loss of generality, we focus on characterizing equilibria in linear strategies, where qii = aii θi

and

qji = E [qii |Ij ] = aii E [θj |Ii ] .

(6)

It is further useful to think of the organization designer choosing both decision rules qii = aii θi and communication qualities (r1 , r2 ), with 1 − ri = e−λti , rather than simply an allocation of attention (t1 , t2 ). The attention constraint t1 + t2 ≤ τ is then equivalent to (1 − r1 ) (1 − r2 ) ≥ e−λτ .

(7)

The communication constraint (7) will be binding at the optimum and characterizes by how much the communication quality about task i must be reduced in order to improve communication about task j 6= i. Note that the sum of communication qualities r1 + r2 is maximized by setting r1 = r2 . The organization design problem is then equivalent to the organization designer choosing (r1 , r2 , a11 , a22 ) in order to maximize the expected profits E [π (q|θ)] = −(1 − a11 )2 σθ2 − (1 − a22 )2 σθ2 − β(1 − r1 )a211 σθ2 − β(1 − r2 )a222 σθ2 , 22

(8)

More generally, for our analysis, it does not matter if agent i maximizes firm peformance π (q|θ) (common

pay-offs) as in Dessein and Santos (2006) or a mix of firm performance and task performance, that is (1 − ω)π (q|θ) + ωπi (q|θ) with ω ∈ (0, 1) , as in Alonso et al. (2008) and Rantakari (2008).

12

subject to constraint (7). Inspecting (8), it is immediate that aii and ri are complementary choices: the larger is aii , the larger should be ri and vice versa. Intuitively, the more adaptive is an agent’s task (the larger is aii ), the more important it is to communicate effectively regarding this task in order to ensure coordination. In return, when communication about a task is better, it is optimal for the agent-in-charge to become more adaptive to task-specific shocks, as the other agent is more likely to take the appropriate coordinating action. Indeed, optimizing a11 and a22 given r1 and r2 yields a11 =

1 1 + β(1 − r1 )

and

a22 =

1 . 1 + β(1 − r2 )

(9)

Hence, the larger is ri , the more adaptive is task i to its local shock θi . In the absence of decreasing marginal returns to attention, the above complementarity creates a powerful force towards organizational focus where all attention is devoted to one task. Substituting (9) and (7) into (8), and taking the derivative of expected profits with respect to r1 yields

where a11 and a22

dE [π(q|θ)] dr2 2 2 2 2 = βa11 σθ + βa22 σθ , (10) dr1 dr1 are given by (9). As we discuss next, for r1 > r2 , the sign of (10) depends

on how important is coordination (β) and how scarce is attention (τ ). As a benchmark, consider the case where dr2 /dr1 = −1, so that any increase in the effectiveness of communication about task 1 is offset by an equal decrease in communication quality about task 2. This would have been the case, for example, if the communication constraint were given by r1 + r2 ≤ τ rather than (7). Since a11 > a22 whenever r1 > r2 , we have that dE [π (θ)] /dr1 > 0 whenever r1 > r2 . It follows that the organization designer chooses either (r1 , r2 ) = (r(τ ), 0) or (r1 , r2 ) = (0, r(τ )) as long as r(τ ) < 1. In other words, with constant returns to attention (as reflected in dr2 /dr1 = −1) , it is optimal to only communicate about one task (say task 1) and let this task be very adaptive (a11 large). Coordination about the other task (say task 2) about which there is no communication (r2 = 0), is then achieved by restricting its adaptiveness (setting a22 low). Given constraint (7), however, we have that for r1 > r2 : dr2 1 − r2 =− < −1. dr1 1 − r1 13

(11)

Whenever r1 > r2 , any increase in the effectiveness of communication about task 1 is then offset by a more than equal decrease in the communication quality about task 2. Constraint (7) therefore reflects decreasing marginal returns to attention, providing a counter-veiling force against organizational focus. We distinguish between high and low task-interdependence. When task-interdependence is low, that is β < 1, one can verify that (10) is negative whenever r1 > r2 . The organization designer then optimally sets r1 = r2 and a11 = a22 . Proposition 1 If β < 1 then organizational balance is optimal: (t∗1 , t∗2 ) = ( τ2 , τ2 ). Proposition 1 states that if coordination is not much of a concern, then it is always optimal to split attention evenly among both task. Intuitively, given the limited need for coordination, the adaptiveness of any given task is only marginally affected by improvements in communication quality. The fundamental complementarity at the heart of our model – where a higher communication quality results in more adaptive tasks, making it optimal to further improve communication – is then relatively weak. In contrast, decreasing marginal return to attention (as reflected in dr2 /dr1 ) are independent of β. For β sufficiently small, the organization designer then optimally sets r1 = r2 . Whenever β > 1, however, complementarities induced by the need for coordination may result in organizational focus. Consider the maximal communication quality r1 = r(τ ) that can be achieved by allocating all attention to task 1. From (10), even when there is no communication about task 2 and r1 = r(τ ), we have that dE(π)/dr1 > 0 provided r(τ ) is small enough. Hence, when attention is sufficiently scarce (τ small) organizational focus is locally optimal. In contrast, when attention is not scarce (τ is large), then (r1 , r2 ) = (r(τ ), 0) cannot be optimal as −dr2 /dr1 in (10) grows without bound. The following proposition characterizes the optimal allocation of attention for β > 1 : Proposition 2 Suppose β > 1. There exists > (β) such that: (i) Organizational focus is optimal, t∗1 ∈ {0, τ } and t∗2 = τ − t∗1 , if and only if τ ≤ > (β) (ii) Organizational balance is optimal, (t∗1 , t∗2 ) = ( τ2 , τ2 ), if τ > > (β) (iii) >(β) is increasing in the importance of coordination, β. 14

Figure 1 summarizes Proposition 1 and Proposition 2. Organizations which are “ somewhat” focused are never optimal. Indeed, if full focus is not optimal, the organization divides its attention equally among both tasks. Intuitively, given the complementarities between the adaptiveness of a task and the attention devoted to a task, the organization either completely ignores a task, or it devotes a substantial amount of attention to it. At the threshold > (β) , the organization makes this shift from no attention to one task, to an equal amount of attention to both tasks. Ex-ante and ex-post coordination. A useful way to understand the above results is through the notion that there are two ways to maintain coordination in an organization. One way is for the organization to devote substantial attention to a task (setting ri high). The agent in charge of this task can then be very responsive to his local information as the other agents in the organization will likely be aware of his actions, by means of communication, and take the appropriate coordinating actions. In Dessein and Santos (2006), this was referred to as expost coordination. An alternative way is for the agent to simply ignore his private information and always implement his task in the same manner (setting aii low). Other agents can then maintain coordination with this task without having to devote any attention to it. This can be seen as ex-ante coordination. The notions of ex-ante and ex-post coordination correspond to the two general ways in which organizations can be coordinated according to March and Simon (1958): coordination by plan, which requires that tasks are executed in a more or less standardized way (e.g. following standard operating procedures), and coordination by feedback, which involves the transmission of new information. As March and Simon note ”it is possible to reduce the volume of communication required from day-to-day by substituting coordination by plan for coordination by feedback” (p.183). While in Dessein and Santos (2006) all tasks were treated symmetrically by assumption, the insight of Proposition 2 is that when attention is scarce (that is τ < > (β)), it is optimal to coordinate ex-ante on one of the tasks and coordinate ex-post on the other task. The first task is then very rigid and insensitive to its local information, so that the organization can afford to ignore this task and fully allocate its attention to the second task, allowing it to be flexible and adaptive. Despite a limited attention capacity, both tasks are then well coordinated, but only one task is very sensitive to its environment. In contrast, when 15

Figure 1: Focused and balanced organizations in the two-agent case. τ 6

> (β) Balanced organizations t = [ τ2 , τ2 ]

Focused organizations t = [τ, 0] or t = [0, τ ]

-

β

βb attention is plentiful, it is optimal for both tasks to be very adaptive, as they both can be coordinated ex-post through communication.23 Comparative statics with respect to communication technology. Proposition 2 yields an interesting comparative static result with respect to exogenous changes in the communication capacity τ. Improvements in the communication technology (E-mail, wireless communication devices, intranets, ...) can be interpreted as an exogenous increase in τ . An implication of Proposition 2, therefore, is that such technological improvements result in a shift from focused organizations which are centered around one task and excel on that task at the expense of others, towards more balanced organizations which aim to perform equally well on all tasks, but excel in none. The above insight also has implications for the importance of leadership in teams. At 23

An alternative explanation for rigid, bureaucratic decision-making is provided by the influence-cost

literature (Milgrom and Roberts 1988). See Powell (2013) for a model in which the organization designer reduces the adaptiveness of decisions to new information in order to limit influence activities.

16

the threshold > (β) the organization changes from having a single agent who monopolizes all information flows (the leader) to a structure with shared leadership. Hence, an increased communication capacity may come at the expense of the original leader in an organization, who may face a discrete loss of power and influence in the organization. As a result, his task is less adapted to its environment and, typically, other tasks are less well coordinated with it. From having a complete monopoly on attention in the organization, this leader now must share it equally with the other agent engaged in team production. As discussed in the introduction, the above predictions are consistent with new trends in organizational design towards more network-like organizations where communication flows are horizontal rather than vertical, and decision-making and influence is broadly shared in the organization, as documented in both case studies (for example “Proctor & Gamble Organization 2005,” HBS case 9-707-519) and large scale empirical studies (Whittington et al. 1999, Guadalupe et al. 2012).24 Task asymmetries and strategic direction. In our model, tasks are ex ante symmetric and it does not matter which task the organization focuses on. In reality, of course, tasks are likely to differ from each other. The question, then, is not only how focused to be, but which tasks to focus on. A first asymmetry concerns the expected magnitude of the shocks faced by each task. If shock θi has a variance σi2 with σ12 > σ22 , then it is easy to show that whenever τ ≤ > (β) , the organization optimally focusses on task 1, that is t = (τ, 0). Therefore, the result that the organization is indifferent as to which task to focus on is knife-edge. As there may be uncertainty about the variance of each task-specific shock, this creates a role for management in setting the “strategy” of the firm by identifying which task the organization should focus on. As we discuss in Section 5, in a dynamic version of our model, the focus of an organization may be difficult to change over time because of complementary organization design choices such as hiring decisions and investments in human capital. Firms which focus on the “right” task may then outperform their competitors over time. Firms who refuse to make a choice and split attention, however, are likely to be worst off. A second asymmetry is one where one task imposes larger coordination costs (delays, low product quality) should the other task not take the appropriate coordinating action. For 24

See also Roberts and Saloner (2013), Section 3.5.

17

example, in designing a car, important changes made to how the engine works, may have important consequences for the remainder of the design. Should attention be focused on those highly interdependent tasks? Appendix C extends the two-task model to allow tasks to have different degrees of interdependence. We show that, perhaps counter-intuitively, when some tasks are more interdependent than others and attention is relatively scarce, it is optimal not to focus attention on highly interdependent tasks, but instead restrict their adaptiveness.

3.2

Organizational focus with many agents

We now extend our analysis to allow for an arbitrary number of agents in the team. We first show that the optimal organizational form is the `−leader organization, which features a number ` ≤ n of equally adaptive agents (leaders) to whom the organization devotes an equal amount of attention, whereas no attention is devoted to any agent who is not a leader. We then provide comparative statics to highlight how the optimal organizational form adapts to change in communication capacity, task-interdependence and the complexity of the production process. Definition: The `−leader organization. In an `−leader organization, the set of agents can be partitioned in a set of leaders L (t) and followers F (t) such that 1. The number of leaders is ` ≤ n. 2. For each follower i ∈ F (t), ti = 0. 3. For each leader j ∈ L (t), tj = τ` . An `−leader organization has the property that there is a number of agents `, which we call leaders, to whom all agents (including other leaders) pay equal attention, and a second class of agents to whom no other agent in the organization pays attention. Proposition 3 The optimal organization is an `−leader organization with ` ∈ {1, 2, · · · , n}. The proof of Proposition 3 follows from the observation that in an optimal organization, all agents who receive some positive attention must receive the same attention. To see 18

the intuition behind this result, let i and j be two tasks and assume that ti > tj > 0, in violation of the claim. In the case of two tasks, it was shown that either t∗1 ∈ {0, τ } , or t∗1 = t∗2 = τ /2. Following the same logic, one can equally show that, keeping the attention allocated to all other tasks k ∈ / {i, j} fixed, profits can always be strictly increased by either setting t0i = ti + tj and t0j = 0 or, alternatively, equalizing attention across tasks i and j, that is setting ti = tj = (ti + tj )/2. As in the two tasks case, it is optimal to either allocate a substantial amount of attention to any given task, allowing it to become very adaptive and coordinate this task ex post, or force a task to largely ignore its local information and coordinate this task with others ex ante, which does not require any attention. When the communication network takes the form of an `−leader organization, the expected profit function can be written as: E [π (q|θ)] = −` (1 − al )2 + (1 − rl )a2l β(n − 1) σθ2 − (n − `) (1 − af )2 + a2f β(n − 1) σθ2

(12)

where 1 − rl = e−λτ /` , and al =

1 1 + (1 − rl )β(n − 1)

and

af =

1 1 + β(n − 1)

The optimal number of leaders, then, is given by `∗ = argmax`∈{1,2,··· ,n} E [π (θ)] .

(13)

Armed with (13) we are able to offer a sharp characterization of the `−leader organization as a function of the organization’s communication capacity τ and the task-interdependence or coordination parameter β. Proposition 4 There exists 0 < β (n) < ... < β (` + 1) < β (`) < ...< β (2) such that 1. `∗ is decreasing in β, i.e., `∗ = n if β < β (n) , `∗ = ` ∈ {2, · · · , n − 1} if β ∈ (β (` + 1) , β (`)), and `∗ = 1 if β > β (2) 2. For all ` ∈ {1, ..., n}, β (`) is increasing in τ and limτ →∞ β (`) = ∞.

19

Figure 2: Optimal number of leaders and adaptation as a function of β Example: n = 20, σθ2 = 1, τ = 50/19, and β ∈ [0, 1]. Panel A: Optimal number of leaders, `∗ , as a function of the importance of coordination β. Panel B: Leader adaptation Ωl ≡ al .

Panel B: Leader adaptation 1

18

0.95

16

0.9

14

0.85

12

0.8

Ωl

l*

Panel A: Optimal number of leaders 20

10

0.75

8

0.7

6

0.65

4

0.6

2

0.55

0

0

0.2

0.4

0.6

0.8

0.5

1

β

0

0.2

0.4

0.6

0.8

1

β

Figure 2 illustrates the results of Proposition 4 for a specific numerical example (n = 20, σθ2 = 1 and τ = 50). Start with Panel A, which plots the optimal number of leaders `∗ as a function of the importance of coordination, β ∈ (0, 1). A balanced organization is optimal when coordination is sufficiently unimportant. In this specific example, whenever β < β (20) ≈ 0.06 the organization is fully balanced, that is, `∗ = n = 20. As coordination becomes more important, the communication becomes more focused around fewer leaders. Finally, when tasks are sufficiently interdependent, when β > β (1) ≈ 0.73, the organization has a single leader, `∗ = 1. Panel B of Figure 2 shows how al , the adaptiveness of each leader j ∈ L (t) to his local shock, changes as tasks become more interdependent as measured by β. Interestingly, leaders tend to be much more adaptive when coordination costs are higher, as they then share influence with fewer other leaders. For example, when β = 0.8, there is only one leader, but this leader is, roughly, 50% more adaptive to his local information then when

20

β = 0.2 and the number of leaders equals `∗ = 3. Proposition 4 also shows how an exogenous change in attention capacity τ, increases the number of leaders and makes the organization more balanced. Again, this implies that as communication technology improves, organizations become less focused and leadership is more broadly shared. We conclude this section by providing a result on how the optimal number of leaders depends on the complexity (or organizational size) of the production process, as captured by the number n of complementary tasks. This result will also be useful when analyzing endogenous organizational size (Section 4). Let `∗ (n) be the optimal number of leaders given that the production requires the implementation of n tasks. Proposition 5 Larger organizations are more focussed than smaller organizations: if `∗ (n+ 1) < n + 1, then `∗ (n + 1) ≤ `∗ (n). As an organization grows larger and its production becomes more complex, coordinationadaptation trade-offs become more pronounced, forcing the organization to become more focused. Despite the organization having more members, fewer of them receive attention. This result is consistent with the life cycle theory of organizations (Greiner 1972) according to which small entrepreneurial firms, as they grow bigger, move from an initial phase of creativity, where almost all employees are involved in decision-making to a phase of direction, where “able, directive leadership” is installed, and where “[t]he new manager and his or her key supervisors assume most of the responsibility for instituting direction.” Consistent with our model, Greiner also describes how firms become less adaptive and responsive to their environment as they grow larger.

4

Organizational Size

One interpretation of our model is that each task corresponds to a different type of product or service that is produced or delivered by a multi-product firm. Under this interpretation, it is natural for the organization designer to optimally chose the size of the organization. By engaging in multiple tasks, firms can spread out fixed costs and realize scope economies 21

(Panzar and Willig, 1981). Doing so, however, increases coordination costs as now more tasks need to be coordinated. We therefore modify our model to include fixed production costs, such as production facilities or a distribution network. We investigate how the optimal size (or scope) of an organization interacts with organizational focus, and how organizational size and focus are jointly optimized in response to more volatile environments and improvements in communication technology.25

4.1

A model of endogenous organizational size

We posit that the realized profit of an organization which performs n tasks (or produces n products) equals π(n) = nP − F + π(q|θ),

(14)

where P > 0 represents the revenues that can be obtained per task, F > 0 are the fixed costs which are shared by all tasks/products, and the last term in (14) is the profits of the n tasks model (expression 1). We further assume that regardless of the size of the organization, there is a fixed communication capacity τ which can be spent communicating about the n product-lines. Intuitively, as the organization grows, meetings become larger as more agents need to be coordinated. But each agent can still only spend time τ in such a meeting, hence the length of the public meeting remains τ . As shown in Proposition 3, for a given organizational size n, the optimal organization is a `−leader organization. If `∗ (n) is the optimal number of leaders given n tasks, then the expected profits of an organization of size n are given by E [π(n)] = nP − F − nσθ2 + `∗ (n) + (n − `∗ (n))

σθ2 1 + (n − 1)βe−2τ /`∗ (n)

σθ2 . 1 + (n − 1)β

We assume that organizational size is chosen to maximize profits per product-line: 1 n∗ = arg max E[π(n)]. n n 25

We endogenize organizational size, not firm size. See Friebel and Raith (2010) and Dessein (2014) (and

references therein) for models of multi-product firms with endogenous firm boundaries.

22

Our underlying assumption is that firms, whenever profitable, have the option to operate a set of product lines independently as a separate organization.26 The following proposition shows the optimal organizational response to an increase in the volatility of the environment σθ2 . Proposition 6 The optimal organization size n∗ is decreasing in σθ2 . Furthermore, organizational balance, `∗ (n∗ )/n∗ , is increasing in σθ2 . If `∗ (n∗ ) < n∗ , the number of leaders is increasing in σθ2 . Management scholars have cited many reasons for the rise of new organizational forms, but one line of explanation which is especially prominent is the “increased turbulence” that managers face because of rapid technological changes, deregulation, and globalization (Rivkin and Siggelkow, 2005; Roberts and Saloner, 2013).27 As σθ2 increases, so do the incentives to adapt, which in turn bring coordination costs. By narrowing firm scope (reducing n∗ ), organizations partially reduce these coordination costs, allowing for a better adaptation. Put differently, organizations trade-off economics of scale and scope with adaptation to a changing environment. Proposition 6 therefore reflects the common idea that smaller organizations are more “nimble” and “flexible.” Since for a given organizational size n, the number of leaders `∗ is independent of σθ2 , and, from Propositions 5, `∗ weakly decreases with n, we obtain that a more volatile environment not only results in smaller but also more balanced organizations.2829

4.2

Information technology and organizational size

Given that limited attention and coordination costs restrict organizational size in our model, one may (naively) expect organizational size to be increasing in the communication capacity 26

Organizational boundaries are then based on who communicates with whom: Agents belong to the same

organization if they communicate with each other. 27 The other prominent line of explanation, information technology, is addressed in the next section. 28 Comparative statics with respect to an increase in the shareable fixed costs F are less interesting: n∗ is increasing in F , `∗ (n∗ )/n∗ , is decreasing in F , and if `∗ (n∗ ) < n∗ , the number of leaders is decreasing in F. 29 Rantakari (2013a) obtains a related result in a different setting. He shows how firms operating in more volatile environments decentralize decision-making and reduce task-interdependence, whereas in our model, firms become more balanced and reduce firm scope.

23

τ, and to be decreasing in the interdependence of tasks, β. What truly restricts organizational size, however, is the adaptiveness of the organization. In our model, there are no constraints on organizational size as long as agents do not adapt tasks to local shocks. When communication capacity is limited, organizations may therefore pursue two distinctive organizational strategies: (1) Adaptation: (Largely) give up on scope economies, and have a small, but adaptive organization, or (2) Economies of Scope: (Largely) give up on adapting to local shocks, and instead leverage economies of scale and scope. When scope economies (F ) are large, but communication capacity (τ ) is very limited, organizations optimally choose to minimize average fixed costs at the expense of adaptation to local shocks. Organization-wise, this strategy consists of having a large, rigid organization, and focussing all attention on one or a few leaders. Large, non-adaptive organizations with one or a few leaders are then optimal. As τ becomes larger, however, the organization may gradually want to use the extra communication capacity to become more adaptive. Doing so without incurring substantial coordination costs, however, requires reducing organizational size, often substantially. At the same time, a smaller size allows the organization to pay attention to a larger number of tasks or leaders. It is only when the communication capacity is sufficiently large that organizations can pursue both objectives, scale economies and adaptation, in which case organizational size is increasing again in τ . Figures 3 illustrates the above organizational strategies in response to changes in τ . For simplicity, it is assumed that n∗ is constrained to n∗ ≤ n = 18. For τ very small, the organization size is set at the maximum, n∗ = n. As communication capacity τ increases, the organization is transformed from a “large, rigid bureaucracy” with eighteen tasks but only one leader who monopolizes all attention into a “nimble, adaptive democracy” with only six tasks who share the attention evenly. For larger values of τ, organizational size slowly increases again with τ and attention remains evenly distributed. To the extent improvements in information and communication technology (ICT) correspond to an increase in the communication capacity, our model predicts that improvements in ICT may result in a shift from large inflexible organizations emphasizing economies of scale and scope, towards smaller, more balanced organizations, which are focused on be-

24

Figure 3: Endogenous organizational scope and focus as a function of τ Example: maximum number of tasks n = 18, σθ2 = 1, β = .25 and F = 3. Optimal organizational size, n∗ (continuous line), and of leaders, `∗ (dashed line), as a function of τ .

18

16

14

12

l* n*

10

8

6

4

2

0 0

1

2

3

4

5 τ

6

7

8

9

10

ing adaptive to external shocks and emphasize horizontal communication linkages.30 This is consistent with recent trends in organization design, as described by Whittington et al. (1999) and Roberts and Saloner (2013). According to our model, only organizations that are already very adaptive, respond to ICT improvements by increasing organizational scope. Alternatively, observed trends toward de-sizing and de-scoping may have been a response to an increased variability in the environment (Proposition 6), for example because of globalization and increased competition (Rivkin and Siggelkow, 2005; Roberts and Saloner, 2013). While we have emphasized non-monotone comparative statics with respect to τ, similar intuitions apply when comparative statics with respect to β are considered. When tasks are sufficiently interdependent (β large), organizations may give up on adaptation and maximize scale and scope economies by increasing organizational size. 30

This prediction stands in contrast with those of obtained in recent team-theory models that model or-

ganizations as information-processing (Bolton and Dewatripont 1994) or problem-solving institutions (Garicano, 2000; Garicano and Rossi-Hansberg, 2006). While these papers also characterize optimal information flows in organizations, improvements in communication technology unambiguously result in larger and more centralized organizations.

25

5

Concluding remarks

In this paper, we have studied the optimal allocation of scarce attention in organizations that face competing objectives of coordination, adaptation and, in the latter part of our paper, economies of scale and scope. We believe the proposed framework sheds light not only on the importance of organizational focus, but also on complex and often contradictory trends in organizational design, often attributed to improvements in information technology and an increasingly volatile business environment. To conclude, we expand on three elements which are absent from our model: skill-heterogeneity among employees, conflicts of interests and coordination through hierarchies. Skill-heterogeneity, functional cultures and the permanence of leadership. In order to isolate the role of leadership and focus in organizations, we considered a model in which all tasks and agents are ex ante symmetric. Indeed, when attention is not scarce, all tasks are treated symmetrically and the organization is a “team of leaders.” It is informational constraints, not technological constraints, that induce the need for leadership and organizational focus.31 Therefore, which tasks the organization focusses on, or who becomes a leader, is irrelevant in our model. The importance of organizational focus, however, does have implications for what type of employees an organization may want to hire. It may therefore cause asymmetries in the skill-level of employees. Higher-skilled employees may understand better how to adapt to a changing environment. If higher-skilled employees command higher wages, organizations that are focused on, say, engineering, will then optimally recruit higher-skilled engineers than organizations which are focussed on, say, marketing and sales. If communication will mainly pertain to coordinating engineering initiatives, it may further be optimal to hire employees with an engineering background even for marketing positions in order to foster communication. Over time “functional cultures” may develop, where certain parts of the organization become “backwaters,” whereas other parts dominate not only decision-making but also the talent pool of the organization. Technology companies are often reputed for their ”functional cultures”. Google, for example, proclaims “[it] is and 31

Appendix B shows how our model can accommodate technological trade-offs between adaptation and

coordination, and yield similar insights.

26

always will be an engineering company,” and almost all employees have advanced engineering degrees even in non-engineering tasks. Walter Isaacson’s biography of Steve Jobs, on the other hand, paints a picture of Apple as a company run by designers, where engineers were very often ignored. To the extent that employee turn-over is limited and hiring is costly and incremental, an initial focus on a particular set of tasks may therefore have long-term consequences. While our model is static, this suggests it may be neither feasible nor desirable for an organization to “rotate” its focus or leadership. Conflict, power struggles and organizational change. Incentives play no role in our theory, but if agents care mainly about the performance of “their task” (as modeled, for example, in Alonso et al. 2008), power struggles and conflict may arise as to whom becomes the focus of the organization. A dynamic version of our model where the allocation of attention is decentralized, rather than optimally allocated by an organization designer, may investigate how agents act strategically to become the center of attention. To the extent that leadership is an endogenous and self-enforcing phenomenon, organizations may get stuck in inefficient equilibria. Indeed, agents pay attention to whomever they believe is very adaptive, and agents are adaptive only when they are the the center of attention. Hence, multiple leadership equilibria may exist. Leadership may further become entrenched, with current leaders (agents who are currently the focus of the organization) being unwilling to give up their central position in the network. Our model further suggests that an increase in communication capacity can be “traumatic” for the existing leadership of the organization. As communication capacity increases, overall adaptiveness increases, but so can be the number of leaders. Given that attention is shared equally among the leaders of the organization, this often implies that each individual leader receives less attention. This effect can be particularly pronounced when there is a move away from a one-leader organization to a setting with multiple leaders. Coordination and Hierarchies. In our model, production is carried out by a team of agents and decision-making is de facto decentralized. Informal leadership then arises endogenously to ensure coordination. In practice, organizations often achieve coordination by

27

centralizing decisions and/or introducing layers of hierarchy. In the context of our model, the organization could centralize production in the hands of a headquarter manager, who takes all production decisions after communicating with agents. Centralized production has the virtue of ensuring perfectly coordinated decisions. Centralized organizations, however, are likely to be less adaptive to shocks as they must rely on (limited) communication with agents. In ongoing work, we show that whenever attention is scarce, decentralized team production is then preferred over centralized production. Hence, centralized production is unlikely to be optimal in fast-moving environments where τ is small. As discussed above, empirical evidence indicates that many organizations have gone through a process of delayering in recent decades, suggesting they have come to rely less on hierarchies for coordination. To shed light on this trend, it would be interesting to introduce multi-layered hierarchies in our model, where such layers provide an alternative way of coordinating production.

28

References Alonso, Ricardo, Wouter Dessein and Niko Matouschek (2008). “When does Coordination Require Centralization?” American Economic Review, 98 (1), 145-179 Alonso, Ricardo, Wouter Dessein and Niko Matouschek (2013). “Organizing to Adapt and Compete.” American Economic Journal: Microeconomics. Forthcoming. Arrow, Kenneth (1974). The Limits of Organization. W. W. Norton & Company, New York. Autor, David (2012). “Market Structure, Organizational Structure and Wage Structure.” Labor Economics II Lecture Notes, MIT, available at http://economics.mit.edu/faculty/dautor/courses. Blanes i Vidal, Jordi and Marc Moller (2013). “Decision-making and implementation in teams.” Mimeo, LSE. Bolton, Patrick, Markus Brunnermeier and Laura Veldkamp (2013). “Leadership, Coordination and Corporate Culture.” Review of Economic Studies, 80(2), 512-537. Bolton, Patrick and Mathias Dewatripont (1994). “The Firm as a Communication Network.” Quarterly Journal of Economics, 109, 809-839. Bolton, Patrick, and Mathias Dewatripont (2013). ”Authority in organizations.” Handbook of Organizational Economics, 342-372, eds. R. Gibbons and J. Roberts. Princeton University Press. Calvo-Armengol, Antonio, Joan de Marti and Andrea Prat (2011). “Communicaton and Influence.” Mimeo, LSE. Campbell, Arthur, Florian Ederer and Johannes Spinnewijn (2014). “Freeriding and information revelation in partnerships.” American Economic Journal: Microeconomics. 6(2): 163-204. Cover, Thomas M., and Joy A. Thomas (1991). Elements of information theory. John Wiley & Sons, New York. Cremer, Jacques (1980). “A Partial Theory of the Optimal Organization of a Bureaucracy.” Bell Journal of Economics, 11(2), 683-693.

29

Cremer, Jacques (1993). “Corporate Culture and Shared Knowledge.” Industrial and Corporate Change, 2 (3), 351-386. Cremer, Jacques, Luis Garicano, and Andrea Prat (2007). ”Language and the Theory of the Firm.” Quarterly Journal of Economics 122(1), 373-407. Dessein, Wouter and Tano Santos (2006). “Adaptive Organizations.” Journal of Political Economy, 114 (50), 956-995. Dessein, Wouter (2014). “Incomplete Contracts and Firm Boundaries: New Directions.” Journal of Law, Economics and Organization, 30(S1), i13-i36. Dewan, Torun and David Myatt (2008). “The Qualities of Leadership: Direction, Communication, and Obfuscation.” American Political Science Review. 102 (3), 351-368. Dewatripont, Mathias (2006). ”Costly communication and incentives.” Journal of the European Economic Association. 4 (2-3), 253-268. Dewatripont, Mathias, Ian Jewitt, Ian and Jean Tirole (1999). “The Economics of Career Concerns, Part II: Application to Missions and Accountability of Government Agencies.” The Review of Economic Studies, 66 (1), 199-217. Dewatripont, Mathias, and Jean Tirole (2005), “Modes of Communication.” Journal of Political Economy, 113(6) 1217-1238. Friebel, Guido and Michael Raith (2010), “Resource Allocation and Organizational Form.” American Economic Journal: Microeconomics, 2(2), 1-33 Galeotti, Andrea, Christian Ghiglino and Francesco Squintani (2013). “Strategic Information Transmission in Networks.” Journal of Economic Theory, 148(5), 1751-1769. Garicano, Luis (2000). “Hierarchies and the Organization of Knowledge in Production.” Journal of Political Economy, 108, 874-904. Garicano, Luis, and Esteban Rossi-Hansberg (2006). “Organization and Inequality in a Knowledge Economy.” Quarterly Journal of Economics 121(4): 1383-1435.

30

Garicano, Luis and Timothy Van Zandt (2013). “Hierarchies and the Division of Labor.” Handbook of Organizational Economics, 604-654, eds. R. Gibbons and J. Roberts. Princeton University Press. Geanakoplos, John, and Paul Milgrom (1991). “A Theory of Hierarchies Based on Limited Managerial Attention.” Journal of the Japanese and International Economies, 5(3), 205–225. Greiner, Larry E. [1972](1998) “Evolution and revolution as organizations grow.” Harvard Business Review 76(3): 55-68. Guadalupe, Maria, Hongyi Li, and Julie Wulf (2012). “Who Lives in the C-Suite? Organizational Structure and the Division of Labor in Top Management.” NBER working paper. Hagenbach, Jeanne and Frederic Koessler (2010).“Strategic Communication Networks.” Review of Economic Studies, 77(3), 1072-1099. Hart, Oliver and John Moore (2005)“On the design of hierarchies: Coordination versus specialization.” Journal of Political. Economy 113(4): 675- 702 Hellwig, Christian and Laura Veldkamp (2009) “Knowing What Others Know: Coordination Motives in Information Acquisition.” Review of Economic Studies, 76(1), 223-251. Holmstrom, Bengt, and Paul Milgrom (1991) “Multitask principal-agent analysis: incentive contracts, asset ownership, and job design.” Journal of Law, Economics and Organization, 7, 24-52. Holmstrom, Bengt and Paul Milgrom (1994)“The firm as an incentive system.” American Economic Review, 84, 972-991. March, James G., and Herbert A. Simon (1958). Organizations. Wiley (USA). Marschak, Jacob and R. Radner (1972). Economic Theory of Teams. Yale University Press. Milgrom, Paul and John Roberts (1988). “An Economic Approach to In‡uence Activities in Organizations,” American Journal of Sociology, 94(1), 154–179.

31

Morris Stephen, and Hyun Song Shin (2002). “Social Value of Public Information,” American Economic Review, 92(5), 1521-1534. Morris Stephen, and Hyun Song Shin (2010). “Optimal Communication,” Journal of the European Economic Association, Vol 5 (2-3), 594-602. Prahalad, C.K. and Gary Hamel (1990). “The core competence of the corporation,” Harvard Business Review, 68, 79-91. Panzar, John C., and Robert D. Willig (1981). ”Economies of scope.” American Economic Review, 268-272. Prat, Andrea (2002). “Should a Team be Homogenous?” European Economic Review, 46(7), 1187–1207. Porter, Michael (1985). Competitive Advantage: Creating and Sustaining Superior Performance. New York: Free Press Porter, Michael (1996). “What is Strategy?” Harvard Business Review, 74, 61-78. Powell, Michael (2013). “An Influence-Cost Model of Organizational Practices and Firm Boundaries.” Mimeo, Northwestern University. Radner, Roy (1993). ”The organization of decentralized information processing.” Econometrica, 61(5), 1109-1146. Rantakari, Heikki (2008). “Governing Adaptation.” Review of Economic Studies, 75, 1257-1285. Rantakari, Heikki (2013a), ”Organizational Design and Environmental Volatility.” Journal of Law, Economics, and Organization, 29(3), 569-607. Rantakari, Heikki (2013b), “Conflict Resolution and Organizational Influence.” Mimeo, University of Southern California.. Roberts John and Garth Saloner (2013). “Strategy and Organization,” Handbook of Organizational Economics, eds. Robert Gibbons and John Roberts, Princeton University Press, 799-849.

32

Rotemberg, Julio, and Garth Saloner (1994). “Benefits of Narrow Business Strategies,” American Economic Review, 84 (5), 1330-1349. Rotemberg, Julio, and Garth Saloner (2000). “Visionaries, Managers, and Strategic Direction,” RAND Journal of Economics,31 (4), 693-716. Simon, Herbert (1971).“Designing Organizations for an Information-Rich World,” Computers, Communication, and the Public Interest, ed Martin Greenberger, Baltimore, MD: The Johns Hopkins Press. Sims, Christopher (2003). “Implications of rational inattention,” Journal of Monetary Economics. 50 (3), 665-690. Siggelkow, N., and Rivkin, J. W. (2005). “Speed and search: Designing organizations for turbulence and complexity.” Organization Science, 16(2): 101-122. Van den Steen, Eric (2005). “Organizational Beliefs and Managerial Vision,” The Journal of Law, Economics, and Organization, 21 (1), 256-283. Van den Steen, Eric (2013). “A Formal Theory of Strategy.” Mimeo, Harvard Business School. Wernerfelt, Birger (2004). “Organizational Languages.” Journal of Economics & Management Strategy, 13: 461–472. Whittington, Richard, Andrew Pettigrew, Simon Peck, Evelyn Fenton, and Martin Conyon (1999). “Change and Complementarities in the New Competitive Landscape: A European Panel Study, 1992-1996.” Organization Science, 10(5), 583-600.

33

APPENDIX Appendix A: Proofs of the propositions Proof of Proposition 1 and 2. Substituting (9) into (8) we can write expected pro…ts as 2 X : (qj ) = n 2 + 1 + (1 r(ti )) i=1;2 Let t1 = t and t2 =

(15)

t; we consider, without loss of generality, that t 2 [ =2; ]. Taking the derivative of

(15) with respect to t we obtain e ( [1 + e

e t [1 + e t ]2

@E [ (qj )] = @t

t) (

:

t) ]2

(16)

After some plain algebra it follows that @E [ (qj )] > 0 () 1 @t Note that if Consider then 1

2

2

1 then 1

e

> 0 for all

e

< 0:

e

0; hence, optimality implies that t = =2. 2

> 1; de…ne T( ) so that 1

2

e

T( )

= 0. Note that T( ) is increasing in . If 2

> T( ) then 1

< 0 and therefore optimality implies that t 2 f0; g. If

e

< T( ) > 0 and

therefore optimality implies that t = =2. This completes the proof of Proposition 1 and 2. Proof of Proposition 3. Proposition 3 follows as a consequence of the following Lemma 7.

Lemma 7 In an optimal organization if ti > 0 and tj > 0 then ti = tj . Proof of Lemma 7. Using the same method as in Section 3.1, we obtain that expected pro…ts equal E [ (qj )] =

n X

(1

aii )2

2

+ (n

1)(1

r(ti ))a2ii

2

;

i=1

where aii = 1=(1 + (n

r(ti )): Substituting aii and re-arranging terms, we obtain n h i X 2 E( (q; tj )) = n 2 + 1+ (n 1)(1 r(ti ) :

1)(1

i=1

Suppose, for a contradiction, that ti > tj > 0. Consider now two alternative organizations. One organization, denoted by t0 , is the same as organization t, but t0i = ti

and t0j = tj + . The second organization,

denoted by ^t, is the same as organization t, but t^i = ti + and t^j = tj

. These constructions are derived

for some small and positive . Since the three organizations only di¤er in the way attention is distributed for task i and task j, each other task l 6= i; j performs equally across the three organizations. We can write E [ (q; tj )]

= C+

2

E [ (q; t0 j )]

= C+

2

= C+

2

E

q; ^tj

1 1 + (n 1 + (n 1 + (n

1)e 1 1)e 1 1)e

34

ti

(ti

+

1 1 + (n )

(ti + )

+ +

1)e

1 + (n 1 + (n

tj

1 1)e 1 1)e

;

(tj + )

(tj

)

; :

Since t is optimal, we must have that E [ (q; tj )] > E [ (q; t0 j )] : This is is equivalent to

h e

and, since ti > tj , for small 2

1)2 e

(n

(ti +tj )

tj

(ti

e

)

ih

2

tj

we have that e

1)2 e

(n

(ti

e

)

i 1 > 0;

(ti +tj )

> 0 and therefore optimality of t requires that

1 > 0.

Similarly, since t is optimal, we must have that q; ^tj

E [ (q; tj )] > E This is equivalent to

h

(tj

e

(tj

and, since ti > tj , we have that e 1)2 e

(ti +tj )

)

ti

e )

ih ti

e

2

1)2 e

(n

:

i 1 > 0;

(ti +tj )

> 0, and therefore optimality of t requires that

2

(n

1 < 0. We have then reached a contradiction. This completes the proof of Lemma 7.

Proof of Proposition 4. Expected payo¤s of an `-leader organization can be rewritten as E( (q; tj )) =

n

2

2

+`

1 + (n

2

1)(1

+ (n

r( =`))

`)

1 + (n

1)

:

Hence, we obtain that dE [ (q; tj )] = d` where

h (`; ; ; n) = ` 1

and that

2

(1 + (n

e

`

ih 1 + (n 2

d2 E [ (q; tj )] = d`d`

1))` 1 + (n

(n

dE[ (q;tj )] d`

Denote by ~ the solution to 1 Since 1

(n

1)e

`

`

1)e

`

1)e

is decreasing in

n

i 3

( (n h

1

(`; ; ; n);

`

(n

1) + 1)e

1)e

(`; ; ; n) is decreasing in

is the same as the sign of

~ (n

`

1) 2 e

`3 1 + (n

Observation 1. By direct veri…cation, the function that the sign of

1)e

1)e

`

i

`

;

:

for all `; ; n. Note also

(`; ; ; n).

= 0. Also, denote by ^ the solution to 1

^ (n

1)e

and decreasing in `, the following observation follows:

35

= 0.

Observation 2. (2a) ~ < ^ for all ; n; (2b) If d2 E[ (q;tj )] d`d`

d2 E[ (q;tj )] d`d`

< ~ then

< 0 for all `; (2c) If

> ^ then

> 0 for all `.

We now show that there exists a

( ; n) > 0 such that for all

<

optimal organization is ` = n. Denote by ( ; n) the solution to n 1

e

(n; ( ; n); x; n) = 0. Explicitly, e

n

n

~:

( ; n) = n 1

e

( ; n) the number of leaders in the

n

Observation 3. Direct veri…cation implies (3a) ( ; n) < ~ for all ; n; (3b) ( ; n) is increasing in . Observation 3a together with observation 2b imply that for all

< ( ; n), the lower value of dE [ (q; tj )] j`=n = d`

because, by observation 1, Hence, for all

dE[ (q;tj )] d`

dE[ (q;tj )] d`

is declining in ` for all

is obtained when ` = n, and, at ` = n we have

2

(1 + (n

(n; ; ; n) >

< ( ; n). So,

1))n 1 + (n

1)e

(n; ; ; n) > 0;

n

(n; ( ; n); ; n), and, by de…nition,

(n; ( ; n); ; n) = 0.

< ( ; n) the expected returns of an `-leader organization are increasing in the number of

leaders, which implies that the optimal organization has ` = n leaders. Next, observation 3b together with the observation that lim

!0

( ; n) = 1, imply that for all

< 1, the

optimal organization has ` = n leaders, regardless of the level of . We now show that there exists a ( ; n) > ( ; n) such that for all the number of leaders is ` = 1. Denote by ( ; n) the solution to ( ; n) =

1

e

e 1+e

> ( ; n) in the optimal organization (1; ( ; n); ; n) = 0. Explicitly ^:

Observation 4. Direct veri…cation shows: 4a. ~ < ( ; n) < ^ , for all

and n; 4b.

( ; n) is increasing in

(1; ; ; n) < 0 for all

> ( ; n). Similarly,

.

Observation 1 together with

(1; ( ; n); ; n) = 0 imply that

observation 1 together with

(n; ( ; n); ; n) = 0 and observation 4a, imply that

> ( ; n). So,

dE[ (q;tj )] d`

(n; ; ; n) < 0 for all

is negative at ` = 1 and at ` = n. Observation 4a and observation 2b implies

36

that

dE[ (q;tj )] d`

is either …rst decreasing in ` and then increasing in ` (when

2 [ ( ; n); ^ ]) or it is always

> ^ ]). Hence, the pro…ts of the organization are decreasing in ` for all

increasing in ` (when

> ( ) and

therefore the optimal organization has ` = 1 leader. We now conclude by considering the case where

2 ( ( ; n); ( ; n)). From the analysis above we infer that

the marginal expected pro…ts to ` of the organization around ` = 1 are positive, because (1; ; ; n) > 0, and that the marginal expected pro…ts of the organization around ` = n are negative, because Furthermore, observation 2b implies that, for all organization,

dE[ (q;tj )] , d`

dE[ (q;tj )] j`=` d`

2 ( ( ; n); ( ; n)), the marginal expected pro…ts of the

are either always decreasing in ` (when

in ` and then increasing in ` (when

(n; ; ; n) < 0.

2 [ ( ; n); ~ ]) or they are …rst decreasing

2 [ ~ ; ( ; n)]). Hence, there exists a unique ` 2 [1; n] such that

= 0; such value of ` is the solution to

(` ; ; x; n) = 0 and, ` maximizes the expected

pro…t of the organization. Finally, by applying the implicit function theorem, d` =d

< 0 if and only if

d (` ; ; ; n)=d` < 0. Note that this last inequality holds because the fact that there exists a unique ` in which

(` ; ; ; n) = 0 and the fact that

2 ( ( ; n); ( ; n)) the function

(1; ; ; n) > 0 and

(n; ; ; n) < 0, assure that for all

(`; ; ; n) is decreasing around ` .

We have therefore shown that for every ` 2 f1; :::; n

1g there exists a

= (`+1) the optimal organization has ` = `+1 leaders; b. if has either ` = ` leaders or ` = ` + 1 leaders, and c. if

(` + 1) <

(`) such that: a. if

2 ( (`+1); (`)) the optimal organization

= (`) the optimal organization has ` = ` leaders.

We now show that the optimal number of leaders ` is increasing in , which, in view of the above analysis, amounts in showing that, for every ` 2 f1; :::; n `,

such that at

=

`

1g there exists a unique value of

2 ( (` + 1); (`)), say

the expected pro…t of the `-leader organization is the same as the expected pro…t of

the ` + 1-leader organization. This is what we show next. For brevity de…ne G(x) = e

x

and denote by

(`; ) the di¤erence between the expected pro…t generated

by the ` + 1-leader organization and the expected pro…t generated by the `-leader organization. We obtain (`; ) =

2

1 + (n

`+1 1)G(` + 1)

` 1 + (n

Taking the minimum common denominator, we have that (1 + (n

1)) [(` + 1)(1 + (n

1)G(`))

[1 + (n

37

1)G(`)

1 1 + (n

1)

:

(`; ) = 0 if, and only if, `(1 + (n

1)G(` + 1))]

1)G(`)][1 + (n

1)G(` + 1)]

=

0:

This is a quadratic equation in to check that

and therefore there are only two solutions of . Moreover, it is immediate

= 0 is one of the solution. Hence, there is only one non-zero solution. We have therefore

completed the proof of the …rst part of proposition 4. To complete the proof of the proposition, we show that, for every ` 2 f1; :::; n increasing in . De…ne t =

; then the cut o¤

(1+ (n 1)) (` + 1)(1 + (n

1)e

t `

)

1g, the cut o¤

t `+1

1)e

)

1 + (n

1)e

t `+1

1 + (n

which, after some algebra, is

`+1

=

"

n

t `(`+1)

t

e `+1 + `e t `

1 `+e

(1 + `) t `(`+1):

(1 + `)e

#

Note that nominator is increasing in t because d `e

t `(`+1)

t

+ e `+1 =

dt

t 1 e `+1 `+1

t `2 +`

e

< 0;

whereas the denominator is decreasing in t because t `

d e

t `(`+1)

(1 + `)e

1 e `

=

dt

t `

t `2 +`

e

< 0:

It follows that d

`+1

> 0:

d Note further that lim

= lim

`+1

!1

!1

t 1 `+1 e = +1 `

This concludes the proof of Proposition 4. Proof of Proposition 5: Recall that in an organization of size n and ` leaders the pro…ts are E [ (n; `)] = nP

F

n

2

2

2

+`

1 + (n

=`

1) e

+ (n

`)

1 + (n

1)

+ (n + 1

`)

:

Similarly, in an organization of size n + 1 and ` leaders the pro…ts are E [ (n + 1; `)]

= =

(n + 1)P (n + 1)P

F F

(n + 1) (n + 1)

2

2

2

+`

2 =`

1+n e 2

+

1 + (n

1) ^

2

+`

1 + (n

2

1) ^ e

=`

+ (n

where ^=

is

is the (non-zero) solution of

`+1

`(1 + (n

1

`+1

n (n

38

1)

`)

1 + (n

1) ^

1+n

1)e

t `

= 0;

Given the above derivation we have that 2

2

`n ( ) = arg max

`

l=1:::n

and `n+1 ( ^ ) = arg max

l=1:::n+1

Suppose …rst that `n+1 ( ^ )

1 + (n "

+ (n

=`

1) e

`)

1 + (n

1) 2

2

`

1 + (n

1) ^ e

=`

+ (n

`)

1) ^

1 + (n

n. It the follows from Proposition 4 that since ^ >

#

then `n+1 ( ^ )

`n ( );

which concludes the proof of the …rst part of the proposition. Suppose now that `n+1 ( ^ ) = n + 1; From Proposition 4 we know that: A. `n+1 ( ^ ) = n + 1 if, and only if, ^

( ; n + 1) (where ( ; n + 1) is derived in Proposition 4) and B.

in ` = 1:::n + 1 for all

0

( ; n + 1). Since, as the derivation above show, 0

evaluated at the same value of evaluated at ^

dE[ (n+1;`)] d`

, we have that

dE[ (n;`)] d`

is positive and decreasing

dE[ (n+1;`)] d`

=

dE[ (n;`)] d`

when

is positive and decreasing for all ` = 1:::n when

( ; n + 1). But then, since B holds and since

; it follows that

is decreasing in

2

. The second part of the proposition follows from this result

and proposition 5.

39

For Online Publication. Appendix B: Technological trade-o¤s between adaptation and coordination.

We show that our insights hold in a model of coordination a la Alonso, Dessein, Matouschek (2008), Rantakari (2008) and Calvo-Armengol et al (2011). We consider the case for two agents, but everything can be generalized to n agents. In these class of models, instead of having the distinction between primary action and complementary action, each agent chooses one single action. We posit that agent i chooses qi . Given a particular realization of the string of local information, = [ 1;

2 ],

and a choice of actions, q = [q1 ; q2 ], the realized pro…t of the organization is: (qj ) = K

where

2 1)

(q1

(q2

2 2)

(q1

q2 )2 ;

(19)

is some positive constant. As in the model developed in our paper, agent i has information

set Ii that contains the local shock

i

and a message mj about local shock

j.

The communication

technology follows the description in our basic model. Standard computation allows us to derive agents’ best replies, for a given network t = (t;

t).

We obtain: 1 1+ 1 1+

q1 = q2 =

[

1

+ E [q2 jI1 ]]

(20)

[

2

+ E [q1 jI2 ]]

(21)

Without loss of generality we focus on characterizing equilibria in linear strategies; we write (20) and (21) as q1 = a11 (t1 )

1

+ a12 (t2 )E[ 2 jI1 ]

(22)

q2 = a22 (t2 )

1

+ a21 (t1 )E[ 1 jI2 ]

(23)

Substituting the guess (22) and (23) into (20) and (21), we …nd that the equilibrium actions are q1 = q2 =

1+ 1+2 + 1+ 1+2 +

2

e

t1 1

+

2

e

t2 2

+

1+2 +

2

e

t2

E[ 2 jI1 ]

(24)

1+2 +

2

e

t1

E[ 1 jI2 ]

(25)

Finally substituting (24) and (25) into (19) and taking unconditional expectations we …nd that the problem max E (qj ) s:t:t1 + t2 = t

40

is equivalent to 1 ]1+2 +

max

t2[0;

where t = t1 and t2 =

2

e

t

+

1 2

1+2 +

e

(

t)

t.

Similarly to the proof of proposition 1 and proposition 2, it is easy to verify that @E (qj ) > 0 () (1 + 2 )2 @t

4

e

> 0:

We then obtain a result that is qualitatively the same as the one stated in Proposition 3. For every there exists a ( ) > 0, so that for all for every

< ( ) the optimal organization has t = =2, whereas

> ( ) the optimal organization has t = f0; g. Furthermore, ( ) is increasing in .

For Online Publication: Appendix C: Task Asymmetries. We have shown that organizational focus may be optimal even when tasks (and agents) are ex-ante identical. In reality, tasks are of course likely to di¤er from each other. The question, then, is not only how focussed to be, but which tasks to focus on. An interesting asymmetry is one where some tasks impose larger coordination costs (delays, low product quality) should other tasks not take the appropriate coordinating actions. For example, in designing a car, important changes made to how the engine works, may have important consequences for the remainder of the design. Should attention be focused on those highly interdependent tasks? In this section we show that this is not necessarily the case. For conciseness of the argument, we consider the two-task case. p Let the coordination parameters be 1 and 2 for task 1 and 2, respectively.De…ne = 1 2,

the geometric mean of

1

and 1

The parameter increase in

=

2

and consider situations where (1 + )

and

2

=

(1 + )

1

:

thus determines the “spread” between the coordination costs across tasks: An

> 0 increases the coordination costs associated with task 1 and decreases that of task

2, leaving the geometric average, a su¢ cient statistic for how costly lack of coordination is to the organization, unchanged. When

Proposition 8 Assume 1. If

1

>

= 0 the case collapses to the one considered in Section 2. 2

1, then:

< > ( ), the optimal organization is focused on task 2, i.e., (t1 ; t2 ) = (0; ).

41

2. If

> ( ), let b be the solution to (1 + b)2 e

(a) If < b then (b) If

2

=1:

> t1 > t2 > 0:

b, then (t1 ; t2 ) = ( ; 0) :

If attention is limited,

< > ( ), then all attention is focused on the task which is least

interdependent: Task 2. The reason is that allocating limited attention to task 1 is essentially not worth it as it would translate into limited adaptation given the large coordination costs the organization would bear. Instead, it is better to provide all attention to task 2 and let task 2 be adaptive. Task 1 is then coordinated by restricting its adaptiveness. Instead when the attention capacity is larger and the asymmetry

is not too large, both tasks

receive attention but task 1 receives more than task 2. Intuitively, if both tasks are allowed to be adaptive, more attention needs to be devoted to that task that is more interdependent. If asymmetries between both tasks are su¢ ciently large, task 2 may even receive no attention for > > ( ) : At the threshold ^ = > ( ) ; the organization then switches from being fully focussed on task 2 to being fully focussed on task 1: Proof of Proposition 8. Replicating the analysis for the model with two-tasks, by allowing for asymmetries, we obtain that equilibrium actions are qii = aii

i

and qji = aii E[ j jIi ] and aii =

1 ie

1+

t

:

Furthermore, we can express expected pro…t for a given t as E[ (qj ] =

(1

a11 )2

2

(1

a22 )2

2

1 (1

r1 )a211

2

2 (1

r2 )a222

2

;

(26)

Hence, the organizational problem is to choose t1 = t 2 [0; ] to maximize expression (26). Repeating the arguments developed for the symmetric case, we obtain that the pro…ts of the organization are decreasing in t, if, and only if, [1

1 2e

][

1e

t

2e

(

t)

] > 0:

It is convenient to divide the analysis in two cases. Recall that we are assuming that (which is equivalent of assuming

2

> ^ = 1).

42

(27) > 1+

Case 1. Assume that

1e

2

> 0, or > b. This assumption implies that

1e

t

2e

(

t)

>

0 for all t 2 [0; ]. This in turn implies that the objective function is decreasing in t if, and only if, 1 which is always satis…ed because

1 2e

< 0 ()

> 1 + . So, if

< ln

< ln

> b, it is optimal to set t = 0 and

and

there is focus on task 2. Case 2. Assume now that 2e

(

t)

1e

2

< b. Since

< 0, or

declines in t, it follows that there exists a t so that

such t solves

1= 2

=e

t ) =e

(

t

and since

1

>

2

and e

2e

1

t

1e t

> 0 and since 2e

(

t )

1e

t

= 0. Indeed,

is decreasing in t, it follows that

t > =2. The next two observations complete the proof: First, if 1

1 2e

> 0, then the objective function is increasing in t for t

in t for all t > t . Hence, in the optimal organization t = t . Second, if 1 objective function is decreasing in t for all t

t and it is decreasing 1 2e

t and increasing in t for all t

< 0, then the t . Hence, there

are two candidates for the minimum: either t = 0 or t = . Comparing the two organizations it reveals that since 1 task 2. Note also that 1 if,

1 2e

< 0 the optimal organization has t = 0, and so there is focus on

1 2e

> 0 and

1e

2

< 0, are mutually compatible, if and only

> 1 + , which holds by assumption. This concludes the proof of Proposition 8.

For Online Publication. Appendix D: Information Theory. We now show that an identical residual variance as in (5) obtains if messages mji and local information

i

are normally distributed and the attention constraint (3) is modelled as a constraint

on the total reduction in entropy, as in Information Theory (Cover and Thomas 1991) and the literature on Rational Inattention (Sims 2003). Since, in an equilibrium with linear strategies, the expected pro…ts for a given attention allocation t can be written as a function of the residual variance, identical results obtain with this alternative communication technology.32 32

It is easy to show that, for a given t, the expected pro…ts in an equilibrium with linear strategies can be

written as: E[ (tj )]

n

2

+

n X i=1

Details are available upon request to the authors.

43

2 2

+ (n

1)RV(ti )

:

For simplicity, we focus on the two-task case. Let mi be a message about

i

and let m =

(m1 ; m2 ). The mutual information between m and ; denoted by I( ; m); equals the average amount by which the observation of m reduces uncertainty about the state ; where the ex ante uncertainty is measured by the (di¤erential) entropy of , Z H( ) = f ( ) log f ( )d ; and the uncertainty after observing m is measured by the corresponding entropy Z H( jm) = f ( jm) log f ( jm)d : Denoting by

the (Shannon) capacity of the communication channel, the constraint on information

conveyed by m about

is given by

33

I( ; m) = H( )

H( jm)

:

(28)

Following Sims (2003) and the subsequent literature on rational inattention, we assume that and

2

1

are (independently) normally distributed, and communicated through a Gaussian commu-

nication channel which contaminates its inputs with independent normally distributed noise, e.g., mi =

i

+ i , where

i

is normally distributed. As a result, also m1 and m2 and the conditional

distributions F ( 1 jm1 ) and F ( 2 jm2 ) are independently normally distributed. As noted by Sims, Gaussian communication channels minimize the variance of F ( i jmi ) given the constraint (28) on the mutual information between i:

34

Given that

1

and

i

and mi : Hence, they maximize the correlation between mi with

are independently distributed, we have

2

I( ; m) = I( 1 ; m1 ) + I( 2 ; m2 ); where I( i ; mi ) = H( i ) 2

is given by

1 2

ln(2 e

H( i jmi ): Moreover, since the entropy of a normal variable with variance 2 );

we obtain I( i ; mi ) =

33

(29)

1 ln 2

2

ln Var( i jmi ) :

(30)

The capacity of a channel is a measure of the maximum data rate that can be reliably transmitted over

the channel. Shannon capacity has proven to be an appropriate concept for studying information ‡ows in a variety of disciplines: probability theory, communication theory, computer science, mathematics, statistics, as well as in both portfolio theory and macroeconomics. 34 This follows from a well known result in information theory that among all distributions with the same level of entropy, the normal distribution minimizes the variance.

44

It follows that the constraint (28) on the mutual information between ln

2

2

ln Var( 1 jm1 ) + ln

ln Var( 2 jm2 )

We can now re-interpret the mutual information between mi and the organization to task i: Denoting t1

and m can be rewritten as

I( 1 ; m1 ) and t2

2 :

i

(31)

as the attention devoted by

I( 2 ; m2 ); the constraint on mutual

information (28) imposed by the Shannon capacity becomes equivalent to our attention constraint t1 + t2

:

Using the above formalization, we obtain a tractable expression for RV(ti ) deed, from (30) and ti

V ar( i jmi ). In-

I( i ; mi ), we have 2

ln RV (ti ) = ln

2ti ;

i = 1; 2:

(32)

or still RV (ti ) = where t1 + t2

2

e

2ti

;

i = 1; 2;

(33)

:

For Online Publication. Appendix E: Alternative communication models. E.1. Bilateral communication with aggregate organizational constraints. We now consider that communication is bilateral and that the constraint is at the organizational level. Formally, the allocation of attention is t = ftji gji2N , where tji denotes the amount of communication between agent i and agent j about local information

i.

Let

be the total communication

capacity of the organization. Then, we require that t satis…es XX i

j

We maintain the assumption that r(tij ) = 1

e

tij tij .

: The following equivalent result obtains:

Result 1. In an optimal organization under bilateral communication and constraint allocation of attention t = ftji g satis…es tji = tPi for all i; j 2 N ;

45

, the

where tP = ftP1 ; :::; tPn g is the allocation of attention in an optimal organization under public communication and constraint

P

= =(n

1).

Proof of Result 1. The key step for this equivalence result is the proof of the following Lemma

Lemma 9 Consider bilateral communication and constraint . In an optimal organization all agents devote the same attention to a particular agent, that is, for all i 2 N , tji = tki for all j; k 2 N n fig. Proof of Lemma 9. Suppose that t is optimal and, for a contradiction, assume that there exists 0. De…ne a new organization t0 , which is the same as t with the

some agent i such that tji > tki exception that t0ji = tji

and t0ki = tki + , for some small and positive . Using the expression

for expected payo¤s, it is easy to verify that E [ (q; tj )]

q; t0 j

E

0;

if, and only if, e Since t0ji = tji

+e

t0ki

e

tji

+e

tki

:

(34)

and t0ki = tki + , after some algebra we obtain that condition 34 is equivalent to e

which, for

t0ji

tki

e

(tji

)

() tki

tji

;

su¢ ciently small, contradicts our initial hypothesis that tji > tki . This completes the

proof of Lemma 9. Note that under bilateral communication and arbitrary capacity , Lemma 9 implies that the optimal allocation of attention t satis…es tji = tli for all j; l 6= i. Hence, in the optimal organization every agent j 6= i devotes the same attention to agent i, that is the restriction imposed by public communication. It is immediate to see the relation between

and

P.

E.2. Individual Communication Constraints. So far we have assumed that the communication constraint is determined at the organizational level. Alternatively, each agent may have a limited communication capacity

I:

Formally, let each

agent have access to an individual communication channel, whose …nite capacity

46

I

can be used

to broadcast information to all other agents and/or to process information broadcasted by others. Each agent i then optimally decides on a vector ti = [ti1 ; ti2 ; :::; tii ; :::; tin ] ; where X

tij

I

8i 2 N ;

j2N

(35)

and where tii is the capacity devoted to broadcast information about

i,

and tij is the capacity

devoted to listen to the information broadcasted by agent j 6= i: We maintain the assumption that 1

r(tij ; tjj ) = e

maxftij ;tii g

We now proof the following equivalence result, which again implies

that the optimal organization is an `

leader organization with ` 2 f1; 2;

; ng leaders and that

the same comparative statics hold as in Proposition 4. Result 2. Under individual communication and individual capacity constraint

I,

in the optimal

organization the allocation of attention t = fti jgi;j satis…es tjj = tij = tbij 8i; j 2 N where tb = ftbij gi6=j is the allocation of attention in the optimal organization under bilateral communication and capacity constraint

= (n

1) I .

Proof of Result 2. Consider the case of individual communication with individual capacity constraint a. tji

I.

Suppose that t is an optimal organization. It is immediate to see that t satis…es: P I for all ij 2 N . Now note that if tb is an optimal tii for all i; j 2 N and b. j tji =

organization under bilateral communication and constraint

= (n

1) I , then organization t

with tji = tii = tbji is a feasible organization under individual communication and satis…es property a. and b. above. We now claim that t is optimal under individual communication and individual capacity constraint

I.

Suppose there is another organization t that does strictly better than t .

First, note that the expected pro…t of an organization, for a given t, can be written in terms of residual variances as follows E[ (tj )] =

n

2

+

n X i=1

where RV(tji ; tii ) =

2 (1

2 2

+ (n

1)

Pn

j=1 RV(tji ; tii )

;

r(tji ; tii )). Second, t must satisfy property a and property b and therefore

minftji ; tii g = tji , and so the residual variance that agent j has about task i is RV(tji ). Since t is strictly better than t is follows that the pro…le of residual variances fRV(tji )gji is better than fRV(tji )gji . But then, construct ^ tb as follows: t^bji = tji . Note that ^ tb is feasible under bilateral

47

communication and capacity . Furthermore since the pro…le of residual variances fRV(tji )gji is better than fRV(tji )gji , it must also be true that pro…le of residual variances fRV(t^bji )gji is better than fRV(tbji )gji , and so ^ tb must be strictly better than tb , which contradicts our initial hypothesis that tb is optimal.

For Online Publication. Appendix F: Endogenous Attention Capacity. So far we have taken

to be a hard constraint in the amount of time agents can devote to com-

munication with each other. In practice this is another margin that organizations can use to improve performance, by, for example, allowing more time for meetings and communication between teams. Equivalently, the organization can increase the e¤ective communication capacity ; by cross-training and rotating employees, by hiring employee with higher cognitive abilities, or by investing in communication technology. Assume thus that an organization can acquire a capacity at a cost C ( ). C ( ) represents for example the costs of having team members engaged in communications activities rather than in production. We assume that this cost has the following properties: C (0) = C 0 (0) = 0

C0 ( ) > 0

C 00 ( )

0 and C 000 ( )

0:

The problem of organizational design is now max E (qj ) ;t

Proposition 10 Assume that

> b , then

1. The optimal communication capacity 2. There exists

2

>

2

C( )

subject to

g if

2

(36)

2.

is increasing in

> 0 such that t1 2 f0;

(3) :

2

and t1 =

2

if

2

>

2.

Proof of Proposition 10. We prove each of the two parts of the proposition. First part. We …rst show that the optimal capacity

is increasing in

2

in the focused organi-

zation and in the balanced organization. This, together with Proposition 2, implies the …rst part of Proposition 10: the optimal capacity

is increasing in

48

2.

We consider the focused organization …rst. Recall that the expected pro…ts in the focused organization are c

E[

Taking the derivative with respect to

)]

=

@

Since C 00 ( ) and, as

@E[

c (qj

@

)]

j

[1 + e

< 0. Moreover " 2 2e @ 2 E [ c (qj )] = @ @ [1 + e

0, C 000 ( )

increases, 1

negative for all

0 and 1 e

e

c (qj

@E[

)]

@

)]

c (qj

)]

2

is increasing in

=

c

of the optimality condition of ! 0 and that

h

3

]

1

c

j

=0

i

e

> 0, and that, since C 0 ( ) > 0, #

+ C 00 ( ) : (recall that @ 2 E[ c (qj )] @ @

> ^ = 1) is either

and negative otherwise. Summarizing,

is (i) positive at

c

= 0, (ii) negative at

= 1 and (iii)

! 1 as

2

uniquely solves c

2e

[1 + e

c

]

C 0 ( c ) = 0:

2

and since, from above,

the implicit function theorem implies that

2

)]

@

or it is …rst increasing and then decreasing in . As a consequence of

@ c (qj

@E[

becomes eventually positive, it follows that

@E [

@

c (qj

]

is negative for small value of

(i)-(iii) we obtain that the optimal capacity

@E[

C 0 ( ):

=1

we have shown that the function

Since

2

> 0, or it is positive for small value of

it is either decreasing in

C( ):

2e

We now observe that, since C 0 (0) = 0, it follows that it follows that

e 1+ e

+

we have

c (qj

@E [

1 1+

2

(qj )] =

c

@ 2 E[ c (qj )] j = @ @

is an increasing function of

c

< 0, an application of 2.

From investigation

and the assumptions that C 0 (0) = 0, it follows that

c

! 0 as

! 1.

We now consider the case in which the organization is balanced. The expected pro…ts in the balanced organization are E

h

d

i (qj ) =

Taking the derivative with respect to @E

d (qj

@

2

2e

2

1+ e we obtain )

2e =h 1+ e

49

C( ):

2

2

2

i2

C 0 ( ):

We can now proceed in the same fashion as in the case for the balanced organization to conclude d

that the optimal capacity

uniquely solves d (qj

@E

d

2

=h 1+ e

@ and that

d

2e

)

d

2,

is an increasing function of

C 0 ( d ) = 0;

i2

d 2

2

! 0 as

d

! 0 and

2

! 1 as

! 1.

Since the optimal capacity in the focused and balanced organization are both increasing in

2

and

since, by Proposition 2, the optimal organization is either focused or balanced, it follows that the 2.

optimal capacity of the optimal organization is increasing in

Second part. We now prove the second part of the proposition. First note that for a given common @E [

c (q;

@

d (q;

@E

j )]

@

j )

> 0;

if, and only if, e [1 + e

e ]2

2

[1 + e

2

]2

> 0;

and, after plain algebra, this condition is equivalent to h Since 2

3

e

c( 2)

e

e

2

is increasing in

ih 1 2

c (^ 2 ) 2

2

e

3 2

i

>0

()

1

2

e

3 2

< 0:

ranging from 0 to 1, there exists a unique ^ 2 that solves 1 2

= 0. By construction, if

= ^ 2 , then

c (^ 2 )

=

d (^ 2 ).

The next observation is used

in the rest of the proof. Observation 1.

d( 2)

<

c (^ 2 )

if, and only if,

2

< ^2. 3

c (^ 2 )

2 2 To see this note that since c is increasing in 2 , it follows that 1 e < 0 for all 2 < ^ 2 . @E [ d (qj )] Hence, j c ( 2 ) < 0, which implies that d ( 2 ) < c ( 2 ). Analogously, since is increasing @ @E [ d (qj )] c 2 2 in 2 , it follows that 1 e 3 ( ) > 0 for all 2 > ^ 2 . Hence, j c ( 2 ) > 0, which implies @ d( 2)

>

De…ne now

2

that

c ( 2 ).

as the solution to 1

2

e

d( 2)

= 0 and de…ne

50

2

be such that 1

2

e

c( 2)

= 0.

2

We now show that

> ^ 2 . By de…nition of ^ 2 and 1

which implies that

d( 2)

>

We now show that

2

2.

>

d( 2)

d( 2)

have that

c ( 2 ).

>

3

e

d (^ 2 ),

d (^ 2 )

=

2

c ( 2 ).

d

and since 2

e

d( 2)

Since

Hence, in order for

e

d( 2)

is increasing in 2

and

e

> ^ 2 and since d( 2)

=

2

e

c( 2)

2

2,

then 1

2

e

c( 2)

0 and 1

2 2

2

e

it follows that

2

> ^2.

c( 2)

;

d( 2)

>

c( 2)

2

for all

to hold we must have that

< 0. From Proposition 2 we know that for all

optimal organization is focused. Hence, if

2

c( 2)

We now complete the proof of the second part of Proposition 10. If and 1

;

we have that 2

=0=1

2

we have that 2

=0=1

2

By de…nition of 1

which implies that

2

2,

2

2,

then 1

such that 1

2

e

2

e

> ^ 2 , we 2

>

d( 2)

2.

0

0 the

the optimal organization is focused. Finally, if

d( 2)

> 0 and therefore, in view of Proposition 2,

it follows that the balanced organization is optimal. From Part 1 of the Proposition, it pays to invest more in communication capacity when the environment becomes more volatile. Intuitively, the cost of not being adapted is then larger and a better communication capacity allows for better adaptation. From Part 2, a focused organization is optimal in environments for which adaptation is not very important. Intuitively, a focused organizations is optimal when the communication capacity is limited, and the organization does not invest much in communication capacity when adaptation is not very important. Similarly, balanced organizations are optimal when adaptation to the environment is very important, and the organization invests heavily in communication capacity.

51

which adaptation is not very important. Intuitively, a focused organizations is optimal when the communication capacity is limited, and the organization does not invest much in communication capacity when adaptation is not very important. Similarly, balanced organizations are optimal when adaptation to the environment is very important, and the organization invests heavily in communication capacity.

52

Rational Inattention and Organizational Focus

Short Description

Description

Comments