To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 賊?霽?悠??頭?逗?寥?畯????? 1001000110101111001111111110100011000111001111111001011101001001001111110011111110010011101010100011111110010000100000000011111110011011100011000011111111111011011011110011111100111111001111110011111100111111 91af3fe8c73f97493f3f93aa3f90803f9b8c3ffb6f3f3f3f3f3f
EUC-JP 賊?霽?悠??頭?逗汶寥?畯????? 1100001010110001001111111111000011001001001111111100110110101010001111110011111111000110101011000011111110111111111000001000111111000110111001011101010111101100001111111000111111001101101110110011111100111111001111110011111100111111 c2b13ff0c93fcdaa3f3fc6ac3fbfe08fc6e5d5ec3f8fcdbb3f3f3f3f3f
UTF-8 賊렠霽렢悠꿱렍頭렧逗汶寥렔畯흘렢당렏렕 111010001011001110001010111010111010000010100000111010011001110010111101111010111010000010100010111001101000001010100000111010101011111110110001111010111010000010001101111010011010000010101101111010111010000010100111111010011000000010010111111001101011000110110110111001011010111110100101111010111010000010010100111001111001010110101111111011011001110110011000111010111010000010100010111010111000101110111001111010111010000010001111111010111010000010010101 e8b38aeba0a0e99cbdeba0a2e682a0eabfb1eba08de9a0adeba0a7e98097e6b1b6e5afa5eba094e795afed9d98eba0a2eb8bb9eba08feba095
UHC 賊렠霽렢悠꿱렍頭렧逗汶寥렔畯흘렢당렏렕 1110111011100100100011101011000111110000101110001000111010110011111010101110110110110010111010001000111010100011110101001110100110001110101101101101010011101000110110101010000111101000111011111000111010101001111100011110000111001000111010101000111010110011101101001110011110001110101001011000111010101010 eee48eb1f0b88eb3eaedb2e88ea3d4e98eb6d4e8daa1e8ef8ea9f1e1c8ea8eb3b4e78ea58eaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)