To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???瘟??葬??娃 00111111001111110011111111100001100010010011111100111111100100011001001000111111001111111000100010100001 3f3f3fe1893f3f91923f3f88a1
EUC-JP ???瘟??葬??娃 00111111001111110011111111100001111010010011111100111111110000011111001000111111001111111011000010100011 3f3f3fe1e93f3fc1f23f3fb0a3
UTF-8 筽몌슝瘟욜돉葬섋걀娃 111001111010110110111101111010111010101010001100111011001000101010011101111001111001100010011111111011001001101010011100111010111000111110001001111010001001000110101100111011001000010010001011111010101011000110000000111001011010100010000011 e7adbdebaa8cec8a9de7989fec9a9ceb8f89e891acec848beab180e5a883
UHC 筽몌슝瘟욜돉葬섋걀娃 1110100010100100101110001110111110111101101110011110100010110000101111111110011110001001100110011110110111110111100110001110100010110000101111111110100011011111 e8a4b8efbdb9e8b0bfe78999edf798e8b0bfe8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)