To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 塋よ?蘖??弱??塋よ?蘖??弱??B 100110101100100010000010111001100011111110011111010100000011111100111111100011101110001100111111001111111001101011001000100000101110011000111111100111110101000000111111001111111000111011100011001111110011111101000010 9ac882e63f9f503f3f8ee33f3f9ac882e63f9f503f3f8ee33f3f42
EUC-JP 塋よ?蘖??弱??塋よ?蘖??弱??B 110101001100101010100100111010000011111111011101101100010011111100111111101111001110010100111111001111111101010011001010101001001110100000111111110111011011000100111111001111111011110011100101001111110011111101000010 d4caa4e83fddb13f3fbce53f3fd4caa4e83fddb13f3fbce53f3f42
UTF-8 塋よ쥤蘖띹꽦弱딀뜆塋よ쥤蘖띹꽦弱딀뜆B 11100101101000011000101111100011100000101000100011101100101001011010010011101000100110001001011011101011100111011011100111101010101111011010011011100101101111001011000111101011100101001000000011101011100111001000011011100101101000011000101111100011100000101000100011101100101001011010010011101000100110001001011011101011100111011011100111101010101111011010011011100101101111001011000111101011100101001000000011101011100111001000011001000010 e5a18be38288eca5a4e89896eb9db9eabda6e5bcb1eb9480eb9c86e5a18be38288eca5a4e89896eb9db9eabda6e5bcb1eb9480eb9c8642
UHC 塋よ쥤蘖띹꽦弱딀뜆塋よ쥤蘖띹꽦弱딀뜆B 11100111101010111010101011101000101000101001011011100101111011101000110111101000100001001011000111100101101100001000101011100110100011011000100111100111101010111010101011101000101000101001011011100101111011101000110111101000100001001011000111100101101100001000101011100110100011011000100101000010 e7abaae8a296e5ee8de884b1e5b08ae68d89e7abaae8a296e5ee8de884b1e5b08ae68d8942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)