To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????k}????????k{^ 001111110011111100111111001111110011111100111111001111110011111101101011011111010011111100111111001111110011111100111111001111110011111100111111011010110111101101011110 3f3f3f3f3f3f3f3f6b7d3f3f3f3f3f3f3f3f6b7b5e
SJIS-WIN 鮓ッ逞巴蠍ク逍セk}鮓ッ逞巴蠍ク逍セk{^ 11101001101101101010111111100111100101111001010001100010111001011011011010111000111001111001011010111110011010110111110111101001101101101010111111100111100101111001010001100010111001011011011010111000111001111001011010111110011010110111101101011110 e9b6afe7979462e5b6b8e796be6b7de9b6afe7979462e5b6b8e796be6b7b5e
EUC-JP 鮓ッ逞巴蠍ク逍セk}鮓ッ逞巴蠍ク逍セk{^ 11110010101110001000111010101111111011011111011111000111110000111110101010111000100011101011100011101101111101101000111010111110011010110111110111110010101110001000111010101111111011011111011111000111110000111110101010111000100011101011100011101101111101101000111010111110011010110111101101011110 f2b88eafedf7c7c3eab88eb8edf68ebe6b7df2b88eafedf7c7c3eab88eb8edf68ebe6b7b5e
UTF-8 鮓ッ逞巴蠍ク逍セk}鮓ッ逞巴蠍ク逍セk{^ 1110100110101110100100111110111110111101101011111110100110000000100111101110010110110111101101001110100010100000100011011110111110111101101110001110100110000000100011011110111110111101101111100110101101111101111010011010111010010011111011111011110110101111111010011000000010011110111001011011011110110100111010001010000010001101111011111011110110111000111010011000000010001101111011111011110110111110011010110111101101011110 e9ae93efbdafe9809ee5b7b4e8a08defbdb8e9808defbdbe6b7de9ae93efbdafe9809ee5b7b4e8a08defbdb8e9808defbdbe6b7b5e
UHC ??逞巴??逍?k}??逞巴??逍?k{^ 001111110011111111010110110000011111011111101001001111110011111111100001110011100011111101101011011111010011111100111111110101101100000111110111111010010011111100111111111000011100111000111111011010110111101101011110 3f3fd6c1f7e93f3fe1ce3f6b7d3f3fd6c1f7e93f3fe1ce3f6b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)