skip to main content
10.1145/2000064.2000103acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
research-article

Power management of online data-intensive services

Published:04 June 2011Publication History

ABSTRACT

Much of the success of the Internet services model can be attributed to the popularity of a class of workloads that we call Online Data-Intensive (OLDI) services. These workloads perform significant computing over massive data sets per user request but, unlike their offline counterparts (such as MapReduce computations), they require responsiveness in the sub-second time scale at high request rates. Large search products, online advertising, and machine translation are examples of workloads in this class. Although the load in OLDI services can vary widely during the day, their energy consumption sees little variance due to the lack of energy proportionality of the underlying machinery. The scale and latency sensitivity of OLDI workloads also make them a challenging target for power management techniques.

We investigate what, if anything, can be done to make OLDI systems more energy-proportional. Specifically, we evaluate the applicability of active and idle low-power modes to reduce the power consumed by the primary server components (processor, memory, and disk), while maintaining tight response time constraints, particularly on 95th-percentile latency. Using Web search as a representative example of this workload class, we first characterize a production Web search workload at cluster-wide scale. We provide a fine-grain characterization and expose the opportunity for power savings using low-power modes of each primary server component. Second, we develop and validate a performance model to evaluate the impact of processor- and memory-based low-power modes on the search latency distribution and consider the benefit of current and foreseeable low-power modes. Our results highlight the challenges of power management for this class of workloads. In contrast to other server workloads, for which idle low-power modes have shown great promise, for OLDI workloads we find that energy-proportionality with acceptable query latency can only be achieved using coordinated, full-system active low-power modes.

Skip Supplemental Material Section

Supplemental Material

isca_7b_1.mp4

mp4

221.7 MB

References

  1. "AMD Family 10h Server and Workstation Processor Power and Thermal Data Sheet Rev 3.15" 2010.Google ScholarGoogle Scholar
  2. "Intel-Xeon Processor 5600 Series. Datasheet, Volume 1," 2010.Google ScholarGoogle Scholar
  3. L. A. Barroso and U. Hölzle, The Datacenter as a Computer. Morgan Claypool, 2009.Google ScholarGoogle Scholar
  4. L. A. Barroso, J. Dean, and U. Hölzle, "Web search for a planet: The google cluster architecture," IEEE Micro, vol. 23, no. 2, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. A. Barroso and U. Hölzle, "The case for energy-proportional computing," Computer, vol. 40, no. 12, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Blaauw, S. Das, and Y. Lee, "Managing variations through adaptive design techniques," Tutorial at International Solid-State Circuits Conference, 2010.Google ScholarGoogle Scholar
  7. E. V. Carrera, E. Pinheiro, and R. Bianchini, "Conserving disk energy in network servers," in Proceedings of the 17th annual international conference on Supercomputing, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. S. Chase, D. C. Anderson, P. N. Thakar, A. M. Vahdat, and R. P. Doyle, "Managing energy and server resources in hosting centers," in Symposium on Operating System Principles, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. V. Delaluz, M. Kandemir, N. Vijaykrishnan, A. Sivasubramaniam, and M. J. Irwin, "Hardware and software techniques for controlling DRAM power modes," IEEE Trans. Comput., vol. 50, no. 11, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Q. Deng, D. Meisner, T. F. Wenisch, and R. Bianchini, "MemScale: Active Low-Power Modes for Main Memory," in Architectural Support for Programming Languages and Operating Systems, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. Diniz, D. Guedes, W. Meira, Jr., and R. Bianchini, "Limiting the power consumption of main memory," in phInternational Symposium on Computer Architecture, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Elnozahy, M. Kistler, and R. Rajamony, "Energy conservation policies for web servers," in phProceedings of the 4th USENIX Symposium on Internet Technologies and Systems, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. X. Fan, C. S. Ellis, and A. R. Lebeck, "The synergy between power-aware memory systems and processor voltage scaling," in Workshop on Power-Aware Computing Systems, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. X. Fan, W.-D. Weber, and L. A. Barroso, "Power provisioning for a warehouse-sized computer," in International Symposium on Computer Architecture, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Gurumurthi, A. Sivasubramaniam, M. Kandemir, and H. Franke, "DRPM: dynamic speed control for power management in server class disks," in International Symposium on Computer Architecture, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Heath, B. Diniz, E. V. Carrera, W. Meira, Jr., and R. Bianchini, "Energy conservation in heterogeneous server clusters," in Principles and Practice of Parallel Programming, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Janzen, "Calculating memory system power for DDR SDRAM," Micron DesignLine, vol. 10, no. 2, 2001.Google ScholarGoogle Scholar
  18. S. Kaxiras and M. Martonosi, Computer Architecture Techniques for Power-Efficiency. Morgan Claypool, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. K. Lim, P. Ranganathan, J. Chang, C. Patel, T. Mudge, and S. Reinhardt, "Understanding and designing new server architectures for emerging warehouse-computing environments," in International Symposium on Computer Architecture, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Meisner, B. T. Gold, and T. F. Wenisch, "PowerNap: Eliminating server idle power," in Architectural Support for Programming Languages and Operating Systems, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Meisner and T. F. Wenisch, "Stochastic Queuing Simulation for Data Center Workloads," in Exascale Evaluation and Research Techniques Workshop, 2010.Google ScholarGoogle Scholar
  22. E. Pinheiro and R. Bianchini, "Energy conservation techniques for disk array-based servers," in International Conference on Supercomputing, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. K. Rajamani, C. Lefurgy, S. Ghiasi, J. Rubio, H. Hanson, and T. W. Keller, "Power management solutions for computer systems and datacenters," in International Symposium on Low-Power Electronics and Design, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. E. Schurman and J. Brutlag, "The user and business impact of server delays, additional bytes, and HTTP chunking in web search," Velocity, 2009.Google ScholarGoogle Scholar
  25. D. Snowdon, S. Ruocco, and G. Heiser, "Power Management and Dynamic Voltage Scaling: Myths and Facts," in Workshop on Power Aware Real-time Computing, 2005.Google ScholarGoogle Scholar
  26. S. Srinivasan, L. Zhao, B. Ganesh, B. Jacob, M. Espig, and R. Iyer, "Cmp memory modeling: How much does accuracy matter?" in Workshop on Modeling, Benchmarking and Simulation, 2009.Google ScholarGoogle Scholar
  27. N. Tolia, Z. Wang, M. Marwah, C. Bash, P. Ranganathan, and X. Zhu, "Delivering energy proportionality with non energy-proportional systems -- optimizing the ensemble," in HotPower, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. VJ Reddi, Benjamin Lee, Trishul Chilimbi, and Kushagra Vaid, "Web Search Using Mobile Cores: Quantifying and Mitigating the Price of Efficiency," in International Symposium on Computer Architecture, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Willis Lang and Jignesh M. Patel and Srinath Shankar, "Wimpy Node Clusters: What About Non-Wimpy Workloads?" in Workshop on Data Management on New Hardware, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. F. Xie, M. Martonosi, and S. Malik, "Intraprogram dynamic voltage scaling: Bounding opportunities with analytic modeling," ACM Trans. Archit. Code Optim, vol. 1, p. 2004, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Power management of online data-intensive services

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ISCA '11: Proceedings of the 38th annual international symposium on Computer architecture
          June 2011
          488 pages
          ISBN:9781450304726
          DOI:10.1145/2000064
          • cover image ACM SIGARCH Computer Architecture News
            ACM SIGARCH Computer Architecture News  Volume 39, Issue 3
            ISCA '11
            June 2011
            462 pages
            ISSN:0163-5964
            DOI:10.1145/2024723
            Issue’s Table of Contents

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 June 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate543of3,203submissions,17%

          Upcoming Conference

          ISCA '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader