To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 甑???姐?義?砥?甑???姐?義?砥?^ 1000110110011001001111110011111100111111100010001011011100111111100010110110000000111111100100110111010100111111100011011001100100111111001111110011111110001000101101110011111110001011011000000011111110010011011101010011111101011110 8d993f3f3f88b73f8b603f93753f8d993f3f3f88b73f8b603f93753f5e
EUC-JP 甑???姐?義?砥?甑???姐?義?砥?^ 1011100111111001001111110011111100111111101100001011100100111111101101011100000100111111110001011101011000111111101110011111100100111111001111110011111110110000101110010011111110110101110000010011111111000101110101100011111101011110 b9f93f3f3fb0b93fb5c13fc5d63fb9f93f3f3fb0b93fb5c13fc5d63f5e
UTF-8 甑비렰렏姐렰義렒砥받甑비렰렏姐렰義렒砥밗^ 11100111100101001001000111101011101110011000010011101011101000001011000011101011101000001000111111100101101001111001000011101011101000001011000011100111101111101010100111101011101000001001001011100111101000001010010111101011101100001001101111100111100101001001000111101011101110011000010011101011101000001011000011101011101000001000111111100101101001111001000011101011101000001011000011100111101111101010100111101011101000001001001011100111101000001010010111101011101100001001011101011110 e79491ebb984eba0b0eba08fe5a790eba0b0e7bea9eba092e7a0a5ebb09be79491ebb984eba0b0eba08fe5a790eba0b0e7bea9eba092e7a0a5ebb0975e
UHC 甑비렰렏姐렰義렒砥받甑비렰렏姐렰義렒砥밗^ 1111000111110111101110101111000110001110101111011000111010100101111011101011101110001110101111011110101111111001100011101010011111110010101100101011100111011110111100011111011110111010111100011000111010111101100011101010010111101110101110111000111010111101111010111111100110001110101001111111001010110010101110011101110001011110 f1f7baf18ebd8ea5eebb8ebdebf98ea7f2b2b9def1f7baf18ebd8ea5eebb8ebdebf98ea7f2b2b9dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)