To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 逆???億??宜 1000101101110100001111110011111100111111100010011010110100111111001111111000101101011000 8b743f3f3f89ad3f3f8b58
EUC-JP 逆???億??宜 1011010111010101001111110011111100111111101100101010111100111111001111111011010110111001 b5d53f3f3fb2af3f3fb5b9
UTF-8 逆곷봿횞億됰굙宜 111010011000000010000110111010101011001110110111111010111011010010111111111011011001101010011110111001011000010010000100111010111001000010110000111010101011010110011001111001011010111010011100 e98086eab3b7ebb4bfed9a9ee58484eb90b0eab599e5ae9c
UHC 逆곷봿횞億됰굙宜 11100110101111011000000111101011100101001000011011000011100101111110010111100010100010011110101110000010100000011110101111110001 e6bd81eb9486c397e5e289eb8281ebf1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)