To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 賊?寃??貊????低大賊?寃??貊????低垈^ 1001000110101111001111111001101110000011001111110011111111100110101110110011111100111111001111110011111110010010111000011001000111100101100100011010111100111111100110111000001100111111001111111110011010111011001111110011111100111111001111111001001011100001100110101011000001011110 91af3f9b833f3fe6bb3f3f3f3f92e191e591af3f9b833f3fe6bb3f3f3f3f92e19ab05e
EUC-JP 賊?寃??貊????低大賊?寃??貊????低垈^ 1100001010110001001111111101010111100011001111110011111111101100101111010011111100111111001111110011111111000100111000111100001011100111110000101011000100111111110101011110001100111111001111111110110010111101001111110011111100111111001111111100010011100011110101001011001001011110 c2b13fd5e33f3fecbd3f3f3f3fc4e3c2e7c2b13fd5e33f3fecbd3f3f3f3fc4e3d4b25e
UTF-8 賊렠寃닿롛貊렩렰罹렗低大賊렠寃닿롛貊렩렰罹렗低垈^ 11101000101100111000101011101011101000001010000011100101101011111000001111101011100010111011111111101011101000011001101111101000101100101000101011101011101000001010100111101011101000001011000011101111101001111010011011101011101000001001011111100100101111011000111011100101101001001010011111101000101100111000101011101011101000001010000011100101101011111000001111101011100010111011111111101011101000011001101111101000101100101000101011101011101000001010100111101011101000001011000011101111101001111010011011101011101000001001011111100100101111011000111011100101100111101000100001011110 e8b38aeba0a0e5af83eb8bbfeba19be8b28aeba0a9eba0b0efa7a6eba097e4bd8ee5a4a7e8b38aeba0a0e5af83eb8bbfeba19be8b28aeba0a9eba0b0efa7a6eba097e4bd8ee59e885e
UHC 賊렠寃닿롛貊렩렰罹렗低大賊렠寃닿롛貊렩렰罹렗低垈^ 11101110111001001000111010110001111010101011001010110100111010101000111011011111110110001110011110001110101101111000111010111101111011001011101010001110101011001110111010111000110100111101111011101110111001001000111010110001111010101011001010110100111010101000111011011111110110001110011110001110101101111000111010111101111011001011101010001110101011001110111010111000110100111101110001011110 eee48eb1eab2b4ea8edfd8e78eb78ebdecba8eaceeb8d3deeee48eb1eab2b4ea8edfd8e78eb78ebdecba8eaceeb8d3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)