To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 底??底??嚥??底 1001001011101010001111110011111110010010111010100011111100111111100110101000101100111111001111111001001011101010 92ea3f3f92ea3f3f9a8b3f3f92ea
EUC-JP 底??底??嚥??底 1100010011101100001111110011111111000100111011000011111100111111110100111110101100111111001111111100010011101100 c4ec3f3fc4ec3f3fd3eb3f3fc4ec
UTF-8 底쇽스底억슝嚥드ㅁ底 111001011011101010010101111011001000011110111101111011001000101010100100111001011011101010010101111011001001011010110101111011001000101010011101111001011001101010100101111010111001001110011100111000111000010110000001111001011011101010010101 e5ba95ec87bdec8aa4e5ba95ec96b5ec8a9de59aa5eb939ce38581e5ba95
UHC 底쇽스底억슝嚥드ㅁ底 1110111010111100101111001110111110111101101110101110111010111100101111101110111110111101101110011110011010111111101101011110010110100100101100011110111010111100 eebcbcefbdbaeebcbeefbdb9e6bfb5e5a4b1eebc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)