To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN ???押??鈺?}v???押??鈺?}vB 00111111001111110011111110001001100111110011111100111111111110111100010000111111011111010111011000111111001111110011111110001001100111110011111100111111111110111100010000111111011111010111011001000010 3f3f3f899f3f3ffbc43f7d763f3f3f899f3f3ffbc43f7d7642
EUC-JP 旿??押??鈺?}v旿??押??鈺?}vB 10001111110000011111010000111111001111111011001010100001001111110011111110001111111000111101010100111111011111010111011010001111110000011111010000111111001111111011001010100001001111110011111110001111111000111101010100111111011111010111011001000010 8fc1f43f3fb2a13f3f8fe3d53f7d768fc1f43f3fb2a13f3f8fe3d53f7d7642
UTF-8 旿닷눎押뜹쪠鈺폝}v旿닷눎押뜹쪠鈺폝}vB 1110011010010111101111111110101110001011101101111110101110001000100011101110011010001010101111001110101110011100101110011110110010101010101000001110100110001000101110101110110110001111100111010111110101110110111001101001011110111111111010111000101110110111111010111000100010001110111001101000101010111100111010111001110010111001111011001010101010100000111010011000100010111010111011011000111110011101011111010111011001000010 e697bfeb8bb7eb888ee68abceb9cb9ecaaa0e988baed8f9d7d76e697bfeb8bb7eb888ee68abceb9cb9ecaaa0e988baed8f9d7d7642
UHC 旿닷눎押뜹쪠鈺폝}v旿닷눎押뜹쪠鈺폝}vB 11100111111110101011010011100101100001111010101011100100111000111011011011100101101001011001100111101000101011011011110101000110011111010111011011100111111110101011010011100101100001111010101011100100111000111011011011100101101001011001100111101000101011011011110101000110011111010111011001000010 e7fab4e587aae4e3b6e5a599e8adbd467d76e7fab4e587aae4e3b6e5a599e8adbd467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)