To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????b[?????????b[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001001011011001111110011111100111111001111110011111100111111001111110011111100111111011000100101101101011110 3f3f3f3f3f3f3f3f3f625b3f3f3f3f3f3f3f3f3f625b5e
SJIS-WIN 瓮??鈺??梧??b[瓮??鈺??梧??b[^ 1110000101000100001111110011111111111011110001000011111100111111100011001110011000111111001111110110001001011011111000010100010000111111001111111111101111000100001111110011111110001100111001100011111100111111011000100101101101011110 e1443f3ffbc43f3f8ce63f3f625be1443f3ffbc43f3f8ce63f3f625b5e
EUC-JP 瓮??鈺??梧??b[瓮??鈺??梧??b[^ 11100001101001010011111100111111100011111110001111010101001111110011111110111000111010000011111100111111011000100101101111100001101001010011111100111111100011111110001111010101001111110011111110111000111010000011111100111111011000100101101101011110 e1a53f3f8fe3d53f3fb8e83f3f625be1a53f3f8fe3d53f3fb8e83f3f625b5e
UTF-8 瓮뚳슛鈺뚳쉐梧삥쑇b[瓮뚳슛鈺뚳쉐梧삥쑇b[^ 1110011110010011101011101110101110011010101100111110110010001010100110111110100110001000101110101110101110011010101100111110110010001001100100001110011010100010101001111110110010000010101001011110110010010001100001110110001001011011111001111001001110101110111010111001101010110011111011001000101010011011111010011000100010111010111010111001101010110011111011001000100110010000111001101010001010100111111011001000001010100101111011001001000110000111011000100101101101011110 e793aeeb9ab3ec8a9be988baeb9ab3ec8990e6a2a7ec82a5ec9187625be793aeeb9ab3ec8a9be988baeb9ab3ec8990e6a2a7ec82a5ec9187625b5e
UHC 瓮뚳슛鈺뚳쉐梧삥쑇b[瓮뚳슛鈺뚳쉐梧삥쑇b[^ 1110100010110111100011001110111110111101101110001110100010101101100011001110111110111101101001101110011111111100101110111110011010011100101001110110001001011011111010001011011110001100111011111011110110111000111010001010110110001100111011111011110110100110111001111111110010111011111001101001110010100111011000100101101101011110 e8b78cefbdb8e8ad8cefbda6e7fcbbe69ca7625be8b78cefbdb8e8ad8cefbda6e7fcbbe69ca7625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)