CURRENT_MEETING_REPORT_ Reported by Guy Almes/Advanced Network and Services Minutes of the Benchmarking Methodology Working Group (BMWG) Introduction At the Danvers IETF, there was a BOF held to consider the need for a set of IP Provider Metrics (IPPM). Between the Danvers and Stockholm IETFs, it was agreed (at least provisionally) to structure any IPPM effort within the BMWG; Guy Almes would organize this effort as a new co-chair of the BMWG. Accordingly, the BMWG met at Stockholm for the sole purpose of initiating a new effort to produce IP Provider Metrics. Guy chaired the session. The chair used a set of slides to organize the session. These slides are reproduced following these minutes. The written minutes will emphasize those points not in the slides or where there was significant discussion on points that are present in the slides. In the minutes, the notation [slide k] will be used to refer to the kth slide. Internet Growth As context, it was noted that the Internet was growing in complexity along a number of dimensions [slide 2]. Further, while the Internet was once a consciously cooperative technical effort in which the operators of Internet components would (comparatively) easily share information, it has become a consciously competitive market in which operators of Internet components are (comparatively) reluctant to share information about the performance and reliability of their networks. General Criteria for IPPM General criteria for IP Provider Metrics were then discussed [slide 4]. It was stressed several times that we would need to be very careful to avoid biased metrics, and that the metrics and methodologies that result from our work should benefit both users and providers. Scott Bradner noted that, based on earlier BMWG experience, vendors will `build to metrics', and that we should therefore keep this in mind as we define our metrics (rather than naively wishing that providers will not `build to metrics'. In all the examples of path performance metrics, Scott noted that there were three important kinds of cases: 1. Measures of the IP path between a pair of specific sites, 2. Measures of the IP path from one specific site to a (conceptually) large number of Internet sites, and 3. Measures of the IP paths among a specific set of sites, as when an organisation uses an IP service as a `virtual private data network'. In a closely related comment, it was noted that Internet exchange points might play a key role in allowing us to characterise IP paths in a way that would allow paths to be composed of two or more IP path segments (e.g., site to exchange point and exchange point to exchange point). In some cases, a metric might have the property that the metric for a complete path could be estimated from metrics of the IP path segments. This would enable interesting methodologies relating to some cases in the previous paragraph. It was also noted that the cost of performing a given measurement should be considered as a criterion. An Initial Set of Needs for Metrics About half of the group's time was devoted to discussing an initial set of needs for metrics. The first set of needs concerns the performance and reliability of paths through the Internet [slide 6]. It was noted that in many cases, paths from a user site to key Internet exchange points would be of great importance. Of the path performance metrics discussed, delay was comparatively easy to measure. Single-application and aggregate flow capacity is more difficult to measure for several reasons: o Tests of flow capacity are hard to do without inducing congestion within the network, thus negatively affecting other users, o The flow capacity of a network depends both on the networks technical qualities and on the current load being placed on the network, o In some cases, users care about the expectation of flow capacity at a typical time; in other cases, they care about near-worst-case flow capacity; thus, measuring flow capacity in the presence of 90th percentile congestion might be useful, and o In some cases (as with distributed interactive simulation) near-real-time flows might need to be sustained, while in others flow capacity measured over several seconds might suffice. Thus, for a number of reasons, flow capacity is more difficult to define and to measure than delay. In addition to congestion affecting any measurement of flow capacity through a network, it was pointed out that a network's robustness in the presence of congestion might be an important property to measure. Reliability is also important but hard to define/measure. In discussions of several of these IP path performance metrics, Scott asked us to consider whether we might include both black box metrics (i.e., viewing the IP path only from the boundaries of IP `clouds') and white box metrics (i.e., viewing also the state of routers and lines within the `cloud'). In discussion, someone mentioned that, of course, any measure that a user could measure from the border of an IP cloud could also be measured by the provider of that cloud. This would be valuable, of course, so that the provider could anticipate and avoid situations in which the performance of the IP cloud would be or appear to be poor as seen by the user. Guy noted that, rather than taking this as an assumption, we should state it as a desirable property of path performance metrics that we design. Gary Malkin observed that we should keep in mind that most users will be new to Internet engineering and of limited technical sophistication. In discussion, it was noted that we would want simple tools usable directly by naive users and more sophisticated tools usable both by sophisticated users, by IP providers, and by organisations functioning as advisers to naive (or sophisticated) users. There was considerable discussion of routing stability [slide 7]. Some felt that the dynamic properties of end-to-end availability, quickness of recovery, and stability were merely technical causes of reliability, and thus not a separate metric in and of themselves. The discussion of DNS performance metrics struck a positive chord [slide 8]. In addition to performance (in the narrow sense) Ruediger Volk suggested that stability and reliability of a DNS service were important. Similarly, NTP service, Web-server service, and NetNews service were thought to be value added services that could be dealt with in a manner similar the treatment of DNS service (assuming that we could do a good job of that). On the other hand, several were pessimistic that useful metrics could arise in the NOC responsiveness area. Administrative Issues At the conclusion of the session, we dealt with several administrative issues [slide 13]: o We agreed that for the present we would operate within the BMWG. o Guy will work to reorganise the IPPM mailing list. The list of those attending the present session will be added to the current IPPM list, and people will then be asked to confirm that they want to be on the list. o Scott Bradner stressed the importance of deliverables, specifically Internet-Drafts that would progress to Informational RFCs. Example topics include: - General metric criteria, and - Descriptions of proposed specific metrics. o Guy said that an IPPM session might be called before the next IETF meeting. Specifically, one might be called to meet in Pittsburgh in early September (adjacent in time to the coming NANOG meeting there).