To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 娃??旬???????????曜??苒λ?^ 100010001010000100111111001111111000111101111011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110010111011010100011111100111111111001001001001010000011110010010011111101011110 88a13f3f8f7b3f3f3f3f3f3f3f3f3f3f3f976a3f3fe49283c93f5e
EUC-JP 娃??旬???????????曜??苒λ?^ 101100001010001100111111001111111011110111011100001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111001101110010110011111100111111111001111111001010100110110010110011111101011110 b0a33f3fbddc3f3f3f3f3f3f3f3f3f3f3fcdcb3f3fe7f2a6cb3f5e
UTF-8 娃듬챶旬쇤슋溜묊젰溜잓겣紐껁꽱曜쏅젒苒λ젍^ 111001011010100010000011111010111001001110101100111011001011000110110110111001101001011110101100111011001000011110100100111011001000101010001011111011111010011110001011111010111010110010001010111011001010000010110000111011111010011110001011111011001001111010010011111010101011001010100011111011111010011110001111111010101011101110000001111010101011110110110001111001101001101110011100111011001000111110000101111011001010000010010010111010001000101110010010110011101011101111101100101000001000110101011110 e5a883eb93acecb1b6e697acec87a4ec8a8befa78bebac8aeca0b0efa78bec9e93eab2a3efa78feabb81eabdb1e69b9cec8f85eca092e88b92cebbeca08d5e
UHC 娃듬챶旬쇤슋溜묊젰溜잓겣紐껁꽱曜쏅젒苒λ젍^ 11101000110111111011010111101011101010101000001111100010111000101011110011101001100110101001101111101010111111101001000111100111101000001010010111101010111111101001111111101001100000011011010111101011101010101000001111100011100001001011110011101000111110001001101111101011101000001001000111100110111111101010010111101011101000001000111001011110 e8dfb5ebaa83e2e2bce99a9beafe91e7a0a5eafe9fe981b5ebaa83e384bce8f89beba091e6fea5eba08e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)