To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 霓????霓????B 11101000101111010011111100111111001111110011111111101000101111010011111100111111001111110011111101000010 e8bd3f3f3f3fe8bd3f3f3f3f42
EUC-JP 霓????霓????B 11110000101111110011111100111111001111110011111111110000101111110011111100111111001111110011111101000010 f0bf3f3f3f3ff0bf3f3f3f3f42
UTF-8 霓ㅻ뀡泥몎霓ㅻ뀡泥몎B 11101001100111001001001111100011100001011011101111101011100000001010000111101111101001111010001111101011101010101000111011101001100111001001001111100011100001011011101111101011100000001010000111101111101001111010001111101011101010101000111001000010 e99c93e385bbeb80a1efa7a3ebaa8ee99c93e385bbeb80a1efa7a3ebaa8e42
UHC 霓ㅻ뀡泥몎霓ㅻ뀡泥몎B 111001111110011110100100111010111000010110011000111011001011001010010001011101101110011111100111101001001110101110000101100110001110110010110010100100010111011001000010 e7e7a4eb8598ecb29176e7e7a4eb8598ecb2917642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)