which naming scheme would give optimal performance on S3?

seenagapeMay 12, 2016

If an application is storing hourly log files from thousands of instances from a high traffic
web site, which naming scheme would give optimal performance on S3?

PrepAway - Latest Free Exam Questions & Answers

A.
Sequential

B.
HH-DD-MM-YYYY-log_instanceID

C.
YYYY-MM-DD-HH-log_instanceID

D.
instanceID_log-HH-DD-MM-YYYY

E.
instanceID_log-YYYY-MM-DD-HH

27 Comments on “which naming scheme would give optimal performance on S3?”

Brian Smith says:

March 25, 2016 at 1:18 am

Probably D

Sandeep says:

May 11, 2016 at 5:23 pm

I agree with D.

Thousands of Instance IDs + Hourly logs seems like the most random sequence option.

Abdul says:

December 29, 2016 at 7:47 pm

Yes, you are correct. You have correct explanation.

0

3

Reply

seenagape says:

May 12, 2016 at 6:15 pm

I choose C

Vijay says:

May 14, 2016 at 10:25 pm

I think B is the correct choice

Martin says:

June 3, 2016 at 6:46 pm

The answer should be B. See http://docs.aws.amazon.com/AmazonS3/latest/dev/request-rate-perf-considerations.html

Balaji says:

June 9, 2016 at 5:47 pm

B looks correct to me,

http://docs.aws.amazon.com/AmazonS3/latest/dev/request-rate-perf-considerations.html

zz says:

June 19, 2016 at 2:33 pm

venkat sai says:

July 8, 2016 at 4:26 am

Yes B is right option. The main reason is the random prefix and the performance would be higher in this case.

A – Don’t make sense
C – YYYY ( This would be same and would be difficult to achieve good performance)
D & E – The instance Id would be same for the first two characters ( i-)

Dev says:

August 3, 2016 at 8:54 am

Ashish Chaturvedi says:

September 24, 2016 at 6:57 am

Niranjana HK says:

September 27, 2016 at 5:52 am

Ankit Shah says:

October 20, 2016 at 7:27 pm

Max says:

November 4, 2016 at 12:53 pm

D. It seems thousands of keys with same prefix “HH-” in one hour is not an optimized performance case.

Duck Bro says:

December 6, 2016 at 4:54 pm

D
Even if the first couple characters are “i-“, the first 3-4 characters provides more random
prefix than HH-DD.

BDA says:

December 28, 2016 at 11:37 am

D , the random hostname prevents hammering a specific partition, and the HH-DD following hostname is more random than E

B will hammer a partition once per day at HH-DD

A changes i/o pattern, does not apply

C is just as bad as A

E is almost as good as D by YYYY will not be as random as D

Ryan says:

December 30, 2016 at 12:44 pm

D is the answer.
A,B,C are all sequential.
E is less random than D.

joe says:

January 25, 2017 at 11:03 am

basant says:

February 20, 2017 at 5:03 am

VK says:

July 5, 2017 at 7:07 pm

C is still sequential. Ans is D

sam says:

August 17, 2017 at 5:05 am

@dynadml says:

October 5, 2017 at 7:07 pm

I think the answer is C because it is anticipated that you will tend to search for logs based on date and time for various instances but the word log should be at the end.

dickloveqdd says:

October 15, 2017 at 10:48 pm

The correct answer is B 参见S3性能优化章节 CDE都是原文的反面教材百分百选B

certified says:

November 10, 2017 at 7:51 am

Anyone who understands how S3 stores data knows that B is the option if you want performance. They key thing to remember here is the more random or changing you can get the prefix to be, the more distributed your objects will be across the stack.

CrazzyFrog says:

December 9, 2017 at 4:59 am

I guess D is correct

PowerCram says:

February 3, 2018 at 1:03 pm

NONE of these answers is correct. In order to partition data stored on S3 the key needs to use one or more slashes (/), therefore the best way in this scenario would be to use _log/YYYY/MM/DD/HH (the order of YY, MM, DD, HH essentially doesn’t matter). This would cause the log file from each instance to be written to a different S3 partition because the instance IDs are unique, therefore they would be an effective hash key.

The way these keys (I.E. file names) are written above they would all be written to the same partition in S3, no matter how the names are jumbled as listed. Effectively there is no difference (performance-wise) among the listed options.

PowerCram says:

February 3, 2018 at 1:05 pm

NONE of these answers is correct. In order to partition data stored on S3 the key needs to use one or more slashes (/), therefore the best way in this scenario would be to use instanceID_log/YYYY/MM/DD/HH (the order of YY, MM, DD, HH essentially doesn’t matter). This would cause the log file from each instance to be written to a different S3 partition because the instance IDs are unique, therefore they would be an effective hash key.

(Had to repost because “instanceID” isn’t displayed.)

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Amazon Knowledge

Free Amazon study guide

which naming scheme would give optimal performance on S3?

27 Comments on “which naming scheme would give optimal performance on S3?”

Leave a Reply Cancel reply