To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚴??誼??邑??? 10011010100011100011111100111111100010110110001000111111001111111001011101010111001111110011111100111111 9a8e3f3f8b623f3f97573f3f3f
EUC-JP 嚴??誼??邑??? 11010011111011100011111100111111101101011100001100111111001111111100110110111000001111110011111100111111 d3ee3f3fb5c33f3fcdb83f3f3f
UTF-8 嚴곷끂誼뷴▶邑뀁젴咽 111001011001101010110100111010101011001110110111111010111000000110000010111010001010101010111100111010111011011110110100111000101001011010110110111010011000001010010001111010111000000010000001111011001010000010110100111011111010011010011110 e59ab4eab3b7eb8182e8aabcebb7b4e296b6e98291eb8081eca0b4efa69e
UHC 嚴곷끂誼뷴▶邑뀁젴咽 1110010111110001100000011110101110000101101110001110101111111110101110101110010110100010101110101110101111101001101100101110110010100000101010001110011011101100 e5f181eb85b8ebfebae5a2baebe9b2eca0a8e6ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)