To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蝪シ遘≪苴謠蝉ソク嶸 1110010110100001101111001110011110100111100000011110000111100100100100111110011010001111100100001110010010111111101110001111101010110100 e5a1bce7a781e1e493e68f90e4bfb8fab4
EUC-JP 蝪シ遘≪苴謠蝉ソク嶸 111010101010001110001110101111001110111010101001101000101110001111100111111100111110101111101111110000001110011010001110101111111000111010111000100011111011101111110100 eaa38ebceea9a2e3e7f3ebefc0e68ebf8eb88fbbf4
UTF-8 蝪シ遘≪苴謠蝉ソク嶸 111010001001110110101010111011111011110110111100111010011000000110011000111000101000100110101010111010001000101110110100111010001010110010100000111010001001110110001001111011111011110110111111111011111011110110111000111001011011011010111000 e89daaefbdbce98198e289aae88bb4e8aca0e89d89efbdbfefbdb8e5b6b8
UHC ???≪?謠???嶸 00111111001111110011111110100001111011000011111111101001101010100011111100111111001111111110011110101110 3f3f3fa1ec3fe9aa3f3f3fe7ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)