[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [Xen-users] Get some useful metrics out of tmem-list
After having upgraded both hypervisor and domU kernels we're now actually using tmem and we can see all those little numbers going up and down in the output of xl tmem-list -la | xen-tmem-list-parse. Now we'd like to extract some useful data from those numbers, to try to answer questions like: - how much tmem are we using? - would we benefit from having more? - how much are we gaining from compression? and from dedup? - and so on I'm mostly interested in global data of the host and not per-domain specific usage of tmem and I want to write a script to extract that data so you can follow it with your preferred monitoring system (as an example, I'm gonna graph that data with Cacti). The first line of output is: total tmem ops=9057925 (errors=539921) -- tmem pages avail=90958 I'm guessing total ops is the sum of all get/put/etc... operations initiated by tmem consumers (xen guests actually) so it should give me an idea of the overall activity on tmem. Does it count all operations (even ones that result in errors) or only successful ones? I'm also guessing errors is (mostly?) failed gets because the page got evicted and failed puts because there isn't space available, so I could infer that continuously incrementing errors should tell me I would benefit from having more tmem available. pages avail is the "free ram" in the hypervisor that could be used for tmem? (ie, if I don't start new guests or enlarge running ones) If so, I would expect that number to reduce after some runtime and stay low unless I kill some running domain, because using cleancache as the only tmem consumer should rarely free up pages. The other two lines of output are: datastructs: objs=5775 (max=15260) pgps=85275 (max=186716) nodes=5901 (max=12744) pages=35501 (max=100026) pcds=84621 (max=184873) deduped: avg=1.38% (curr=0.77%) compression savings=30.09% misc: failed_copies=0 alloc_failed=15416 alloc_page_failed=0 low_mem=0 evicted=0/0 relinq=0/0, max_evicts_per_relinq=0, flush_pools=0, eph_count=85275, eph_max=186716 I guess those numbers will tell me some things like tmem usage (used / free / available?) and how much I'm benefiting from compression and dedup (the % values in the datastructs line?). Can someone confirm / correct my assumptions and fill the voids? Are those 'pages' fixed size ones? If that's the case, what's the page size on x86_64 ? Is the "pages" number the current tmem usage of actual ram? Can I assume that means it contains pages * (1+deduped.curr) * (1+compression_savings) data? thanks, -- Luca Lesinigo _______________________________________________ Xen-users mailing list Xen-users@xxxxxxxxxxxxx http://lists.xen.org/xen-users
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |