To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄μ?悠??猿??歪??淫??碎??筌?(異 1001011011101111100000111100101000111111100101110100100100111111001111111000100110001110001111110011111110011000011000110011111100111111100010001111101000111111001111111110000111101010001111110011111111100010101000110011111110000001011010011000100011011001 96ef83ca3f97493f3f898e3f3f98633f3f88fa3f3fe1ea3f3fe2a33f816988d9
EUC-JP 厄μ?悠??猿??歪??淫??碎??筌?(異 1100110011110001101001101100110000111111110011011010101000111111001111111011000111101110001111110011111111001111110001000011111100111111101100001111110000111111001111111110001011101100001111110011111111100100101001010011111110100001110010101011000011011011 ccf1a6cc3fcdaa3f3fb1ee3f3fcfc43f3fb0fc3f3fe2ec3f3fe4a53fa1cab0db
UTF-8 厄μ떜悠껅궇猿뉎걶歪묅뫕淫앯쥗碎밸듋筌앸(異 1110010110001110100001001100111010111100111010111001011010011100111001101000001010100000111010101011101110000101111010101011011010000111111001111000110010111111111010111000100110001110111010101011000110110110111001101010110110101010111010111010110010000101111010111010101110010101111001101011011110101011111011001001010110101111111011001010010110010111111001111010001010001110111010111011000010111000111010111001001110001011111001111010110110001100111011001001010110111000111011111011110010001000111001111001010110110000 e58e84cebceb969ce682a0eabb85eab687e78cbfeb898eeab1b6e6adaaebac85ebab95e6b7abec95afeca597e7a28eebb0b8eb938be7ad8cec95b8efbc88e795b0
UHC 厄μ떜悠껅궇猿뉎걶歪묅뫕淫앯쥗碎밸듋筌앸(異 1110010011111000101001011110110010001011101100101110101011101101100000111110011010000010101000001110101010111011100001111110001110000001100111001110100011100000100100011110001010010001101101111110101111100010100111011110011110100010100011011110000111101111101110011110101110001010101111101110111110100111100111011110101110100011101010001110110010110110 e4f8a5ec8bb2eaed83e682a0eabb87e3819ce8e091e291b7ebe29de7a28de1efb9eb8abeefa79deba3a8ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)