PerfIso:Performance Isolation for Commercial Latency-Sensitive Services

Cited 68 time in webofscience Cited 0 time in scopus
  • Hit : 133
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorIorgulescu, Calinko
dc.contributor.authorAzimi, Rezako
dc.contributor.authorKwon, Youngjinko
dc.contributor.authorElnikety, Samehko
dc.contributor.authorSyamala, Manojko
dc.contributor.authorNarasayya, Vivekko
dc.contributor.authorHerodotou, Herodotosko
dc.contributor.authorTomita, Pauloko
dc.contributor.authorChen, Alexko
dc.contributor.authorZhang, Jackko
dc.contributor.authorWang, Junhuako
dc.date.accessioned2020-11-24T10:30:23Z-
dc.date.available2020-11-24T10:30:23Z-
dc.date.created2020-11-20-
dc.date.issued2018-07-12-
dc.identifier.citation2018 USENIX Annual Technical Conference, USENIX ATC 2018, pp.519 - 531-
dc.identifier.urihttp://hdl.handle.net/10203/277570-
dc.description.abstractLarge commercial latency-sensitive services, such as web search, run on dedicated clusters provisioned for peak load to ensure responsiveness and tolerate data center outages. As a result, the average load is far lower than the peak load used for provisioning, leading to resource under-utilization. The idle resources can be used to run batch jobs, completing useful work and reducing overall data center provisioning costs. However, this is challenging in practice due to the complexity and stringent tail-latency requirements of latency-sensitive services. Left unmanaged, the competition for machine resources can lead to severe response-time degradation and unmet service-level objectives (SLOs). This work describes PerfIso, a performance isolation framework which has been used for nearly three years in Microsoft Bing, a major search engine, to colocate batch jobs with production latency-sensitive services on over 90,000 servers. We discuss the design and implementation of PerfIso, and conduct an experimental evaluation in a production environment. We show that colocating CPU-intensive jobs with latency-sensitive services increases average CPU utilization from 21% to 66% for off-peak load without impacting tail latency.-
dc.languageEnglish-
dc.publisherUSENIX Association-
dc.titlePerfIso:Performance Isolation for Commercial Latency-Sensitive Services-
dc.typeConference-
dc.identifier.wosid000508006700040-
dc.identifier.scopusid2-s2.0-85077443353-
dc.type.rimsCONF-
dc.citation.beginningpage519-
dc.citation.endingpage531-
dc.citation.publicationname2018 USENIX Annual Technical Conference, USENIX ATC 2018-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationBoston, MA-
dc.contributor.localauthorKwon, Youngjin-
dc.contributor.nonIdAuthorIorgulescu, Calin-
dc.contributor.nonIdAuthorAzimi, Reza-
dc.contributor.nonIdAuthorElnikety, Sameh-
dc.contributor.nonIdAuthorSyamala, Manoj-
dc.contributor.nonIdAuthorNarasayya, Vivek-
dc.contributor.nonIdAuthorHerodotou, Herodotos-
dc.contributor.nonIdAuthorTomita, Paulo-
dc.contributor.nonIdAuthorChen, Alex-
dc.contributor.nonIdAuthorZhang, Jack-
dc.contributor.nonIdAuthorWang, Junhua-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 68 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0