To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 縡????棒??趙貊?肯??窪敎?棒??趙貊?發h 111000110111000100111111001111110011111100111111100101100101111100111111001111111110011011100010111001101011101100111111100011010110110100111111001111111000110001000101111110101100110100111111100101100101111100111111001111111110011011100010111001101011101100111111111000011010001001101000 e3713f3f3f3f965f3f3fe6e2e6bb3f8d6d3f3f8c45facd3f965f3f3fe6e2e6bb3fe1a268
EUC-JP 縡?勖??棒??趙貊?肯??窪??棒??趙貊?發h 11100101110100100011111110001111101100111110110100111111001111111100101111000000001111110011111111101100111001001110110010111101001111111011100111001110001111110011111110110111101001100011111100111111110010111100000000111111001111111110110011100100111011001011110100111111111000101010010001101000 e5d23f8fb3ed3f3fcbc03f3fece4ecbd3fb9ce3f3fb7a63f3fcbc03f3fece4ecbd3fe2a468
UTF-8 縡렕勖쾅렠棒렕렟趙貊렚肯렖렕窪敎렠棒렕렟趙貊렠發h 11100111101110001010000111101011101000001001010111100101100010111001011011101100101111101000010111101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111101000101101101001100111101000101100101000101011101011101000001001101011101000100000101010111111101011101000001001011011101011101000001001010111100111101010101010101011100110100101011000111011101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111101000101101101001100111101000101100101000101011101011101000001010000011100111100110011011110001101000 e7b8a1eba095e58b96ecbe85eba0a0e6a392eba095eba09fe8b699e8b28aeba09ae882afeba096eba095e7aaaae6958eeba0a0e6a392eba095eba09fe8b699e8b28aeba0a0e799bc68
UHC 縡렕勖쾅렠棒렕렟趙貊렚肯렖렕窪敎렠棒렕렟趙貊렠發h 11101110101011011000111010101010111010011110110111000100111001111000111010110001110111001110101010001110101010101000111010110000111100001110000111011000111001111000111010101101110100001110100110001110101010111000111010101010111010001100000111001110111001111000111010110001110111001110101010001110101010101000111010110000111100001110000111011000111001111000111010110001110110111010000101101000 eead8eaae9edc4e78eb1dcea8eaa8eb0f0e1d8e78eadd0e98eab8eaae8c1cee78eb1dcea8eaa8eb0f0e1d8e78eb1dba168

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)