To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鞨ょ、顔エォ驕匁ョ蛾b螟顔エォ鞜匁流^ 11101000111000001000001011100101101001001000101011100111101101001010101111101001100000011001011011100110101011101000100111101001100000101000001011100101101001001000101011100111101101001010101111101000110111111001011011100110100101111010110001011110 e8e082e5a48ae7b4abe98196e6ae89e98282e5a48ae7b4abe8df96e697ac5e
EUC-JP 鞨ょ、顔エォ驕匁ョ蛾b螟顔エォ鞜匁流^ 11110000111000101010010011100111100011101010010010110100111010011000111010110100100011101010101111110001111000011100110011101000100011101010111010110010111010111010001111100010111010101010011010110100111010011000111010110100100011101010101111110000111000011100110011101000110011101010111001011110 f0e2a4e78ea4b4e98eb48eabf1e1cce88eaeb2eba3e2eaa6b4e98eb48eabf0e1cce8ceae5e
UTF-8 鞨ょ、顔エォ驕匁ョ蛾b螟顔エォ鞜匁流^ 11101001100111101010100011100011100000101000011111101111101111011010010011101001101000011001010011101111101111011011010011101111101111011010101111101001101010011001010111100101100011001000000111101111101111011010111011101000100110111011111011101111101111011000001011101000100111101001111111101001101000011001010011101111101111011011010011101111101111011010101111101001100111101001110011100101100011001000000111100110101101011000000101011110 e99ea8e38287efbda4e9a194efbdb4efbdabe9a995e58c81efbdaee89bbeefbd82e89e9fe9a194efbdb4efbdabe99e9ce58c81e6b5815e
UHC 鞨ょ?顔??驕??蛾b螟顔????流^ 11001010111010101010101011100111001111111110010011010100001111110011111111001110111101100011111100111111111001001011011010100011111000101101100110101101111001001101010000111111001111110011111100111111110101111011010101011110 caeaaae73fe4d43f3fcef63f3fe4b6a3e2d9ade4d43f3f3f3fd7b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)