To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???宜?ぜ濡⑥?腰???????????諭? 0011111100111111001111111000101101011000001111111000001010111010100101000100011110000111010001010011111110001101100110000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100101110100000000111111 3f3f3f8b583f82ba944787453f8d983f3f3f3f3f3f3f3f3f3f3f97403f
EUC-JP ???宜?ぜ濡??腰??彛????????諭? 001111110011111100111111101101011011100100111111101001001011110011000111101010000011111100111111101110011111100000111111001111111000111110111100111110100011111100111111001111110011111100111111001111110011111100111111110011011010000100111111 3f3f3fb5b93fa4bcc7a83f3fb9f83f3f8fbcfa3f3f3f3f3f3f3f3fcda13f
UTF-8 嶺뚮뿫宜배ぜ濡⑥맊腰밴내彛볠뇻類ㅼ젌嶪용뜇諭숥 111011111010011010101011111010111001101010101110111010111011111110101011111001011010111010011100111010111011000010110000111000111000000110011100111001101011111110100001111000101001000110100101111010111010011110001010111010001000010110110000111010111011000010110100111010111000001010110100111001011011110110011011111010111011001110100000111010111000011110111011111011111010011110010000111000111000010110111100111011001010000010001100111001011011011010101010111011001001101010101001111010111001110010000111111010001010101110101101111011001000100010100101 efa6abeb9aaeebbfabe5ae9cebb0b0e3819ce6bfa1e291a5eba78ae885b0ebb0b4eb82b4e5bd9bebb3a0eb87bbefa790e385bceca08ce5b6aaec9aa9eb9c87e8abadec88a5
UHC 嶺뚮뿫宜배ぜ濡⑥맊腰밴내彛볠뇻類ㅼ젌嶪용뜇諭숥 11100111101011011000110011101011100101111010101111101011111100011011100111101000101010101011110011101011101000011010100011101100100100001010001011101001101001101011100111101010101100111011101111101100101011011001001111100110101101001010011111101011101110101010010011101100101000001000110111100101111101011011111111101011100011011000101011101011101100011001101001000010 e7ad8ceb97abebf1b9e8aabceba1a8ec90a2e9a6b9eab3bbecad93e6b4a7ebbaa4eca08de5f5bfeb8d8aebb19a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)