To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 第?蔚?愉皎缺猥當第?蔚?愉皎缺猥當B 100100011110011000111111100010010101010100111111100101101111100111100001101001111110001110011110111000001100111011100001011000111001000111100110001111111000100101010101001111111001011011111001111000011010011111100011100111101110000011001110111000010110001101000010 91e63f89553f96f9e1a7e39ee0cee16391e63f89553f96f9e1a7e39ee0cee16342
EUC-JP 第?蔚?愉皎缺猥當第?蔚?愉皎缺猥當B 110000101110100000111111101100011011011000111111110011001111101111100010101010011110010111111110111000001101000011100001110001001100001011101000001111111011000110110110001111111100110011111011111000101010100111100101111111101110000011010000111000011100010001000010 c2e83fb1b63fccfbe2a9e5fee0d0e1c4c2e83fb1b63fccfbe2a9e5fee0d0e1c442
UTF-8 第렰蔚렰愉皎缺猥當第렰蔚렰愉皎缺猥當B 11100111101011001010110011101011101000001011000011101000100101001001101011101011101000001011000011100110100001001000100111100111100110101000111011100111101111001011101011100111100011001010010111100111100101011011011011100111101011001010110011101011101000001011000011101000100101001001101011101011101000001011000011100110100001001000100111100111100110101000111011100111101111001011101011100111100011001010010111100111100101011011011001000010 e7acaceba0b0e8949aeba0b0e68489e79a8ee7bcbae78ca5e795b6e7acaceba0b0e8949aeba0b0e68489e79a8ee7bcbae78ca5e795b642
UHC 第렰蔚렰愉皎缺猥當第렰蔚렰愉皎缺猥當B 11110000101011111000111010111101111010101010010110001110101111011110101011110000110011101110101111001100110000001110100011100101110100111101011111110000101011111000111010111101111010101010010110001110101111011110101011110000110011101110101111001100110000001110100011100101110100111101011101000010 f0af8ebdeaa58ebdeaf0ceebccc0e8e5d3d7f0af8ebdeaa58ebdeaf0ceebccc0e8e5d3d742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)