To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???踰??淫????????諛????〕弛 001111110011111100111111111001101111101000111111001111111000100011111010001111110011111100111111001111110011111100111111001111110011111111100110100001110011111100111111001111110011111110000001011011001001001001101111 3f3f3fe6fa3f3f88fa3f3f3f3f3f3f3f3fe6873f3f3f3f816c926f
EUC-JP ???踰??淫?????彛??諛??獒?〕弛 00111111001111110011111111101100111111000011111100111111101100001111110000111111001111110011111100111111001111111000111110111100111110100011111100111111111010111110011100111111001111111000111111001011101110110011111110100001110011011100001111010000 3f3f3fecfc3f3fb0fc3f3f3f3f3f8fbcfa3f3febe73f3f8fcbbb3fa1cdc3d0
UTF-8 閱묐갭踰딉쭏淫뉖뎠亮쎈굞彛뗥넇諛대옪獒뺣〕弛 111010011001011010110001111010111010110010010000111010101011000010101101111010001011100010110000111010111001010010001001111011001010110110001111111001101011011110101011111010111000100110010110111010111000111010100000111011111010010110110111111011001000111010001000111010101011010110011110111001011011110110011011111010111001011110100101111010111000010010000111111010001010101110011011111010111000110010000000111011001001100010101010111001111000110110010010111010111011101010100011111000111000000010010101111001011011110010011011 e996b1ebac90eab0ade8b8b0eb9489ecad8fe6b7abeb8996eb8ea0efa5b7ec8e88eab59ee5bd9beb97a5eb8487e8ab9beb8c80ec98aae78d92ebbaa3e38095e5bc9b
UHC 閱묐갭踰딉쭏淫뉖뎠亮쎈굞彛뗥넇諛대옪獒뺣〕弛 1110011011110011100100011110101110110000101110001110101110110010100010101110111110100111100010001110101111100010100001111110101110110101101100011110010110111001101111011110101110000010100001101110110010101101100010111110010110000110100101111110101110110000101101001110101110011110101010011110100010100011100101011110101110100001101100111110110010101100 e6f391ebb0b8ebb28aefa788ebe287ebb5b1e5b9bdeb8286ecad8be58697ebb0b4eb9ea9e8a395eba1b3ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)