To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 阿??油??攸?????肄よ?應??悟??? 1000100010100010001111110011111110010110111110110011111100111111100111011011111100111111001111110011111100111111001111111110001111100101100000101110011000111111100111001110010000111111001111111000110011100101001111110011111100111111 88a23f3f96fb3f3f9dbf3f3f3f3f3fe3e582e63f9ce43f3f8ce53f3f3f
EUC-JP 阿??油??攸?????肄よ?應??悟??? 1011000010100100001111110011111111001100111111010011111100111111110110101100000100111111001111110011111100111111001111111110011011100111101001001110100000111111110110001110011000111111001111111011100011100111001111110011111100111111 b0a43f3fccfd3f3fdac13f3f3f3f3fe6e7a4e83fd8e63f3fb8e73f3f3f
UTF-8 阿잆굝油뺝선攸귥뵫力녹떑肄よ뿥應쎄괴悟딅벩溜 111010011001100010111111111011001001111010000110111010101011010110011101111001101011001010111001111010111011101010011101111011001000010010100000111001101001010010111000111010101011011110100101111010111011010110101011111011111010011010001010111010111000010110111001111010111001011010010001111010001000001010000100111000111000001010001000111010111011111110100101111001101000011110001001111011001000111010000100111010101011010010110100111001101000001010011111111010111001010010000101111010111011001010101001111011111010011110001011 e998bfec9e86eab59de6b2b9ebba9dec84a0e694b8eab7a5ebb5abefa68aeb85b9eb9691e88284e38288ebbfa5e68789ec8e84eab4b4e6829feb9485ebb2a9efa78b
UHC 阿잆굝油뺝선攸귥뵫力녹떑肄よ뿥應쎄괴悟딅벩溜 1110010010111001100111111110001110000010100001011110101011111010100101011110010110111100101100011110101011110010100000101110110010010100101010011110011010110011101100111110110010001011101001111110110010111101101010101110100010010111101001011110101111101011101111011110101010110001101010111110011111110110100010101110101110010011101111111110101011111110 e4b99fe38285eafa95e5bcb1eaf282ec94a9e6b3b3ec8ba7ecbdaae897a5ebebbdeab1abe7f68aeb93bfeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)