To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 陝セ闊鯉ス剰嵯豕扶v陝セ闊鯉ス剰嵯豕扶vB 1110100010011111101111101110100010001000100011001110111110111101100011111110100010001101101101011110011010110011100101010111110101110110111010001001111110111110111010001000100010001100111011111011110110001111111010001000110110110101111001101011001110010101011111010111011001000010 e89fbee8888cefbd8fe88db5e6b3957d76e89fbee8888cefbd8fe88db5e6b3957d7642
EUC-JP 陝セ闊鯉ス剰嵯豕扶v陝セ闊鯉ス剰嵯豕扶vB 111100001010000110001110101111101110111111101000101110001111000110001110101111011011111011101010101110101011011111101100101101011100100111011110011101101111000010100001100011101011111011101111111010001011100011110001100011101011110110111110111010101011101010110111111011001011010111001001110111100111011001000010 f0a18ebeefe8b8f18ebdbeeabab7ecb5c9de76f0a18ebeefe8b8f18ebdbeeabab7ecb5c9de7642
UTF-8 陝セ闊鯉ス剰嵯豕扶v陝セ闊鯉ス剰嵯豕扶vB 111010011001100110011101111011111011110110111110111010011001011110001010111010011010111110001001111011111011110110111101111001011000100110110000111001011011010110101111111010001011000110010101111001101000100110110110011101101110100110011001100111011110111110111101101111101110100110010111100010101110100110101111100010011110111110111101101111011110010110001001101100001110010110110101101011111110100010110001100101011110011010001001101101100111011001000010 e9999defbdbee9978ae9af89efbdbde589b0e5b5afe8b195e689b676e9999defbdbee9978ae9af89efbdbde589b0e5b5afe8b195e689b67642
UHC 陝?闊鯉??嵯豕扶v陝?闊鯉??嵯豕扶vB 111000001110110100111111111111001100010011010111111011110011111100111111111100111010101111100011110011101101110110100110011101101110000011101101001111111111110011000100110101111110111100111111001111111111001110101011111000111100111011011101101001100111011001000010 e0ed3ffcc4d7ef3f3ff3abe3cedda676e0ed3ffcc4d7ef3f3ff3abe3cedda67642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)