To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昭セ叱式尺軸シオ昭ロシエ晶ウ鴫竺射質 1000111110111010101111101000111010110110111100011110111010001110101011101000111011011010111100011110111010001110101100101011110010110101100011111011101011011011101111001011010010001111101110111011001110001110101100001111000111101110100011101011000110001110110010111000111010111111 8fbabe8eb6f1ee8eae8edaf1ee8eb2bcb58fbadbbcb48fbbb38eb0f1ee8eb18ecb8ebf
EUC-JP 昭セ叱?式尺?軸シオ昭ロシエ晶ウ鴫?竺射質 101111101011110010001110101111101011110010111000001111111011110010110000101111001101110000111111101111001011010010001110101111001000111010110101101111101011110010001110110110111000111010111100100011101011010010111110101111011000111010110011101111001011001000111111101111001011001110111100110011011011110011000001 bebc8ebebcb83fbcb0bcdc3fbcb48ebc8eb5bebc8edb8ebc8eb4bebd8eb3bcb23fbcb3bccdbcc1
UTF-8 昭セ叱式尺軸シオ昭ロシエ晶ウ鴫竺射質 111001101001100010101101111011111011110110111110111001011000111110110001111011101000010110101001111001011011110010001111111001011011000010111010111011101000010110101001111010001011101110111000111011111011110110111100111011111011110110110101111001101001100010101101111011111011111010011011111011111011110110111100111011111011110110110100111001101001100110110110111011111011110110110011111010011011010010101011111011101000010110101001111001111010101110111010111001011011000010000100111010001011001110101010 e698adefbdbee58fb1ee85a9e5bc8fe5b0baee85a9e8bbb8efbdbcefbdb5e698adefbe9befbdbcefbdb4e699b6efbdb3e9b4abee85a9e7abbae5b084e8b3aa
UHC 昭?叱?式尺?軸??昭???晶???竺射質 11100001101110010011111111110010111010100011111111100011110100101111010010101001001111111111010111101110001111110011111111100001101110010011111100111111001111111110111111011100001111110011111100111111111101011110011111011110110100101111001011110101 e1b93ff2ea3fe3d2f4a93ff5ee3f3fe1b93f3f3fefdc3f3f3ff5e7ded2f2f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)