To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???b????? 001111110011111100111111011000100011111100111111001111110011111100111111 3f3f3f623f3f3f3f3f
SJIS-WIN 菽??b菽??菽? 111001001100000100111111001111110110001011100100110000010011111100111111111001001100000100111111 e4c13f3f62e4c13f3fe4c13f
EUC-JP 菽??b菽??菽? 111010001100001100111111001111110110001011101000110000110011111100111111111010001100001100111111 e8c33f3f62e8c33f3fe8c33f
UTF-8 菽됦돶b菽됦돶菽됥 11101000100011111011110111101011100100001010011011101011100011111011011001100010111010001000111110111101111010111001000010100110111010111000111110110110111010001000111110111101111010111001000010100101 e88fbdeb90a6eb8fb662e88fbdeb90a6eb8fb6e88fbdeb90a5
UHC 菽됦돶b菽됦돶菽됥 1110001011011101100010011110010010001001101110010110001011100010110111011000100111100100100010011011100111100010110111011000100111100011 e2dd89e489b962e2dd89e489b9e2dd89e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)