To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 賊?弔???伊豆?矜?伊逗?瓏?貊??游? 10010001101011110011111110010010101000100011111100111111001111111000100011001001100100111010010000111111111000011110000000111111100010001100100110010000100000000011111111100000111110100011111111100110101110110011111100111111100111111110000000111111 91af3f92a23f3f3f88c993a43fe1e03f88c990803fe0fa3fe6bb3f3f9fe03f
EUC-JP 賊?弔???伊豆?矜?伊逗?瓏?貊??游? 11000010101100010011111111000100101001000011111100111111001111111011000011001011110001101010011000111111111000101110001000111111101100001100101110111111111000000011111111100000111111000011111111101100101111010011111100111111110111101110001000111111 c2b13fc4a43f3f3fb0cbc6a63fe2e23fb0cbbfe03fe0fc3fecbd3f3fdee23f
UTF-8 賊렠弔렟罹렗伊豆렚矜썬伊逗렫瓏롛貊렏렕游쯔 111010001011001110001010111010111010000010100000111001011011110010010100111010111010000010011111111011111010011110100110111010111010000010010111111001001011110010001010111010001011000110000110111010111010000010011010111001111001111110011100111011001000110110101100111001001011110010001010111010011000000010010111111010111010000010101011111001111001001110001111111010111010000110011011111010001011001010001010111010111010000010001111111010111010000010010101111001101011100010111000111011001010111110010100 e8b38aeba0a0e5bc94eba09fefa7a6eba097e4bc8ae8b186eba09ae79f9cec8dace4bc8ae98097eba0abe7938feba19be8b28aeba08feba095e6b8b8ecaf94
UHC 賊렠弔렟罹렗伊豆렚矜썬伊逗렫瓏롛貊렏렕游쯔 111011101110010010001110101100011111000011000000100011101011000011101100101110101000111010101100111011001010010111010100111001111000111010101101110100001110100010111101111000111110110010100101110101001110100010001110101110011101011011101010100011101101111111011000111001111000111010100101100011101010101011101010111111011100001011101010 eee48eb1f0c08eb0ecba8eaceca5d4e78eadd0e8bde3eca5d4e88eb9d6ea8edfd8e78ea58eaaeafdc2ea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)