To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 楷捧?章??楷??陋[楷捧?章??楷??陋[^ 100111101011001010010101111110010011111110001111110011010011111100111111100111101011001000111111001111111110100010011011010110111001111010110010100101011111100100111111100011111100110100111111001111111001111010110010001111110011111111101000100110110101101101011110 9eb295f93f8fcd3f3f9eb23f3fe89b5b9eb295f93f8fcd3f3f9eb23f3fe89b5b5e
EUC-JP 楷捧?章??楷?瀣陋[楷捧?章??楷?瀣陋[^ 11011100101101001100101011111011001111111011111011001111001111110011111111011100101101000011111110001111110010011011000111101111111110110101101111011100101101001100101011111011001111111011111011001111001111110011111111011100101101000011111110001111110010011011000111101111111110110101101101011110 dcb4cafb3fbecf3f3fdcb43f8fc9b1effb5bdcb4cafb3fbecf3f3fdcb43f8fc9b1effb5b5e
UTF-8 楷捧딘章累땄楷렜瀣陋[楷捧딘章累땄楷렜瀣陋[^ 111001101010010110110111111001101000110110100111111010111001010010011000111001111010101110100000111011111010010110001111111010111001010110000100111001101010010110110111111010111010000010011100111001111000000010100011111010011001100110001011010110111110011010100101101101111110011010001101101001111110101110010100100110001110011110101011101000001110111110100101100011111110101110010101100001001110011010100101101101111110101110100000100111001110011110000000101000111110100110011001100010110101101101011110 e6a5b7e68da7eb9498e7aba0efa58feb9584e6a5b7eba09ce780a3e9998b5be6a5b7e68da7eb9498e7aba0efa58feb9584e6a5b7eba09ce780a3e9998b5b5e
UHC 楷捧딘章累땄楷렜瀣陋[楷捧딘章累땄楷렜瀣陋[^ 11111010101011001101110011101001101101011111001011101101111100011101001011101001101101101010010011111010101011001000111010101110111110101010111011010111101100000101101111111010101011001101110011101001101101011111001011101101111100011101001011101001101101101010010011111010101011001000111010101110111110101010111011010111101100000101101101011110 faacdce9b5f2edf1d2e9b6a4faac8eaefaaed7b05bfaacdce9b5f2edf1d2e9b6a4faac8eaefaaed7b05b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)