To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN セュ爾セュ式セュ痔セォ蔀セュ爾セュ式セュ痔セォ蔀^ 101111101010110110001110101000101011111010101101100011101010111010111110101011011000111010100100101111101010101110001110110000011011111010101101100011101010001010111110101011011000111010101110101111101010110110001110101001001011111010101011100011101100000101011110 bead8ea2bead8eaebead8ea4beab8ec1bead8ea2bead8eaebead8ea4beab8ec15e
EUC-JP セュ爾セュ式セュ痔セォ蔀セュ爾セュ式セュ痔セォ蔀^ 10001110101111101000111010101101101111001010010010001110101111101000111010101101101111001011000010001110101111101000111010101101101111001010011010001110101111101000111010101011101111001100001110001110101111101000111010101101101111001010010010001110101111101000111010101101101111001011000010001110101111101000111010101101101111001010011010001110101111101000111010101011101111001100001101011110 8ebe8eadbca48ebe8eadbcb08ebe8eadbca68ebe8eabbcc38ebe8eadbca48ebe8eadbcb08ebe8eadbca68ebe8eabbcc35e
UTF-8 セュ爾セュ式セュ痔セォ蔀セュ爾セュ式セュ痔セォ蔀^ 11101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010110111100101101111001000111111101111101111011011111011101111101111011010110111100111100101111001010011101111101111011011111011101111101111011010101111101000100101001000000011101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010110111100101101111001000111111101111101111011011111011101111101111011010110111100111100101111001010011101111101111011011111011101111101111011010101111101000100101001000000001011110 efbdbeefbdade788beefbdbeefbdade5bc8fefbdbeefbdade79794efbdbeefbdabe89480efbdbeefbdade788beefbdbeefbdade5bc8fefbdbeefbdade79794efbdbeefbdabe894805e
UHC ??爾??式??痔?????爾??式??痔???^ 00111111001111111110110010110011001111110011111111100011110100100011111100111111111101101100000000111111001111110011111100111111001111111110110010110011001111110011111111100011110100100011111100111111111101101100000000111111001111110011111101011110 3f3fecb33f3fe3d23f3ff6c03f3f3f3f3fecb33f3fe3d23f3ff6c03f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)