To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????}v??????}vB 0011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f7d763f3f3f3f3f3f7d7642
SJIS-WIN 鄒」轤頑エェ}v鄒」轤頑エェ}vB 1110011110111110101000111110011110000010100010101110011010110100101010100111110101110110111001111011111010100011111001111000001010001010111001101011010010101010011111010111011001000010 e7bea3e7828ae6b4aa7d76e7bea3e7828ae6b4aa7d7642
EUC-JP 鄒」轤頑エェ}v鄒」轤頑エェ}vB 1110111011000000100011101010001111101101111000101011010011101000100011101011010010001110101010100111110101110110111011101100000010001110101000111110110111100010101101001110100010001110101101001000111010101010011111010111011001000010 eec08ea3ede2b4e88eb48eaa7d76eec08ea3ede2b4e88eb48eaa7d7642
UTF-8 鄒」轤頑エェ}v鄒」轤頑エェ}vB 1110100110000100100100101110111110111101101000111110100010111101101001001110100110100000100100011110111110111101101101001110111110111101101010100111110101110110111010011000010010010010111011111011110110100011111010001011110110100100111010011010000010010001111011111011110110110100111011111011110110101010011111010111011001000010 e98492efbda3e8bda4e9a091efbdb4efbdaa7d76e98492efbda3e8bda4e9a091efbdb4efbdaa7d7642
UHC 鄒??頑??}v鄒??頑??}vB 111101011101101100111111001111111110100011010111001111110011111101111101011101101111010111011011001111110011111111101000110101110011111100111111011111010111011001000010 f5db3f3fe8d73f3f7d76f5db3f3fe8d73f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)