To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????×? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101011100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fd73f
SJIS-WIN 壤??唯??議??壤??幼??議??壤?×? 100110101101111100111111001111111001011101000010001111110011111110001011011000110011111100111111100110101101111100111111001111111001011101100011001111110011111110001011011000110011111100111111100110101101111100111111100000010111111000111111 9adf3f3f97423f3f8b633f3f9adf3f3f97633f3f8b633f3f9adf3f817e3f
EUC-JP 壤??唯??議??壤??幼??議??壤?×瑗 1101010011100001001111110011111111001101101000110011111100111111101101011100010000111111001111111101010011100001001111110011111111001101110001000011111100111111101101011100010000111111001111111101010011100001001111111010000111011111100011111100110011000000 d4e13f3fcda33f3fb5c43f3fd4e13f3fcdc43f3fb5c43f3fd4e13fa1df8fccc0
UTF-8 壤깆쥋唯롥쉽議욧펶壤깆쥉幼싧쉽議우퐭壤깆×瑗 1110010110100011101001001110101010111001100001101110110010100101100010111110010110010100101011111110101110100001101001011110110010001001101111011110100010101101101100001110110010011010101001111110110110001110101101101110010110100011101001001110101010111001100001101110110010100101100010011110010110111001101111001110110010001011101001111110110010001001101111011110100010101101101100001110110010011010101100001110110110010000101011011110010110100011101001001110101010111001100001101100001110010111111001111001000110010111 e5a3a4eab986eca58be594afeba1a5ec89bde8adb0ec9aa7ed8eb6e5a3a4eab986eca589e5b9bcec8ba7ec89bde8adb0ec9ab0ed90ade5a3a4eab986c397e79197
UHC 壤깆쥋唯롥쉽議욧펶壤깆쥉幼싧쉽議우퐭壤깆×瑗 1110010110111101101100011110110010100010100001001110101011100110100011101110010110111101101100011110110010100001101111111110101010111100100001111110010110111101101100011110110010100010100000101110101011101010100110101110010110111101101100011110110010100001101111111110110010111101100101101110010110111101101100011110110010100001101111111110101010111100 e5bdb1eca284eae68ee5bdb1eca1bfeabc87e5bdb1eca282eaea9ae5bdb1eca1bfecbd96e5bdb1eca1bfeabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)