To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴉??溢??喩????????誘り?亦?? 111010011110101100111111001111111000100011101100001111110011111110011010011001110011111100111111001111110011111100111111001111110011111100111111100101110101010110000010111010000011111110010110100100100011111100111111 e9eb3f3f88ec3f3f9a673f3f3f3f3f3f3f3f975582e83f96923f3f
EUC-JP 鴉??溢??喩??饔??嫄??誘り?亦?? 11110010111011010011111100111111101100001110111000111111001111111101001111001000001111110011111110001111111010001110111100111111001111111000111110111010101000010011111100111111110011011011011010100100111010100011111111001011111100100011111100111111 f2ed3f3fb0ee3f3fd3c83f3f8fe8ef3f3f8fbaa13f3fcdb6a4ea3fcbf23f3f
UTF-8 鴉띾콈溢숅윀喩붿뗀饔끸뮧嫄숋쫩誘り텖亦낆슞 111010011011010010001001111010111001110110111110111011001011110110001000111001101011101010100010111011001000100010000101111011001001110010000000111001011001011010101001111010111011011010111111111010111001011110000000111010011010010110010100111010111000000110111000111010111010111010100111111001011010101110000100111011001000100010001011111011001010101110101001111010001010101010011000111000111000001010001010111011011000010110010110111001001011101010100110111010111000001010000110111011001000101010011110 e9b489eb9dbeecbd88e6baa2ec8885ec9c80e596a9ebb6bfeb9780e9a594eb81b8ebaea7e5ab84ec888becaba9e8aa98e3828aed8596e4baa6eb8286ec8a9e
UHC 鴉띾콈溢숅윀喩붿뗀饔끸뮧嫄숋쫩誘り텖亦낆슞 111001001011110010001101111010111011000110000100111011001110111010011001111010011001111110001011111010101110011110010100111011001011011010111110111010001011110110000101111000101001001010110010111010101011000110011001111011111010011010000010111010111010111110101010111010101011011010001111111001101011001010000101111011001001101010101010 e4bc8debb184ecee99e99f8beae794ecb6bee8bd85e292b2eab199efa682ebafaaeab68fe6b285ec9aaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)