To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 陝セ莉呻スケ豕悟椏Lh陝セ莉呻スケ豕悟椏L 111010001001111110111110111001001011101110011001111011111011110110111001111001101011001110001100111001011001111010010011010011000110100011101000100111111011111011100100101110111001100111101111101111011011100111100110101100111000110011100101100111101001001101001100 e89fbee4bb99efbdb9e6b38ce59e934c68e89fbee4bb99efbdb9e6b38ce59e934c
EUC-JP 陝セ莉呻スケ豕悟椏Lh陝セ莉呻スケ豕悟椏L 111100001010000110001110101111101110100010111101110100101111000110001110101111011000111010111001111011001011010110111000111001111101101111110011010011000110100011110000101000011000111010111110111010001011110111010010111100011000111010111101100011101011100111101100101101011011100011100111110110111111001101001100 f0a18ebee8bdd2f18ebd8eb9ecb5b8e7dbf34c68f0a18ebee8bdd2f18ebd8eb9ecb5b8e7dbf34c
UTF-8 陝セ莉呻スケ豕悟椏Lh陝セ莉呻スケ豕悟椏L 111010011001100110011101111011111011110110111110111010001000111010001001111001011001000110111011111011111011110110111101111011111011110110111001111010001011000110010101111001101000001010011111111001101010010010001111010011000110100011101001100110011001110111101111101111011011111011101000100011101000100111100101100100011011101111101111101111011011110111101111101111011011100111101000101100011001010111100110100000101001111111100110101001001000111101001100 e9999defbdbee88e89e591bbefbdbdefbdb9e8b195e6829fe6a48f4c68e9999defbdbee88e89e591bbefbdbdefbdb9e8b195e6829fe6a48f4c
UHC 陝?莉呻??豕悟?Lh陝?莉呻??豕悟?L 11100000111011010011111111010111111010011110001111100010001111110011111111100011110011101110011111110110001111110100110001101000111000001110110100111111110101111110100111100011111000100011111100111111111000111100111011100111111101100011111101001100 e0ed3fd7e9e3e23f3fe3cee7f63f4c68e0ed3fd7e9e3e23f3fe3cee7f63f4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)