To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???遊??幽??億 00111111001111110011111110010111010101100011111100111111100101110100100000111111001111111000100110101101 3f3f3f97563f3f97483f3f89ad
EUC-JP ???遊??幽??億 00111111001111110011111111001101101101110011111100111111110011011010100100111111001111111011001010101111 3f3f3fcdb73f3fcda93f3fb2af
UTF-8 料곗럩遊뷸첀幽덉쨭億 111011111010011010111110111010101011001110010111111010111001111110101001111010011000000110001010111010111011011110111000111011001011001010000000111001011011100110111101111010111000110110001001111011001010100010101101111001011000010010000100 efa6beeab397eb9fa9e9818aebb7b8ecb280e5b9bdeb8d89eca8ade58484
UHC 料곗럩遊뷸첀幽덉쨭億 1110100011110111101100001110110010001110100011001110101110110100101110101110011010101010100011011110101011101011100010001110110010100100100001111110010111100010 e8f7b0ec8e8cebb4bae6aa8deaeb88eca487e5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)