To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 第?蔚?儀皎缺虞當第?蔚?儀皎缺虞當B 100100011110011000111111100010010101010100111111100010110101011011100001101001111110001110011110100010111111000111100001011000111001000111100110001111111000100101010101001111111000101101010110111000011010011111100011100111101000101111110001111000010110001101000010 91e63f89553f8b56e1a7e39e8bf1e16391e63f89553f8b56e1a7e39e8bf1e16342
EUC-JP 第?蔚?儀皎缺虞當第?蔚?儀皎缺虞當B 110000101110100000111111101100011011011000111111101101011011011111100010101010011110010111111110101101101111001111100001110001001100001011101000001111111011000110110110001111111011010110110111111000101010100111100101111111101011011011110011111000011100010001000010 c2e83fb1b63fb5b7e2a9e5feb6f3e1c4c2e83fb1b63fb5b7e2a9e5feb6f3e1c442
UTF-8 第렰蔚렰儀皎缺虞當第렰蔚렰儀皎缺虞當B 11100111101011001010110011101011101000001011000011101000100101001001101011101011101000001011000011100101100001001000000011100111100110101000111011100111101111001011101011101000100110011001111011100111100101011011011011100111101011001010110011101011101000001011000011101000100101001001101011101011101000001011000011100101100001001000000011100111100110101000111011100111101111001011101011101000100110011001111011100111100101011011011001000010 e7acaceba0b0e8949aeba0b0e58480e79a8ee7bcbae8999ee795b6e7acaceba0b0e8949aeba0b0e58480e79a8ee7bcbae8999ee795b642
UHC 第렰蔚렰儀皎缺虞當第렰蔚렰儀皎缺虞當B 11110000101011111000111010111101111010101010010110001110101111011110101111110000110011101110101111001100110000001110100111100101110100111101011111110000101011111000111010111101111010101010010110001110101111011110101111110000110011101110101111001100110000001110100111100101110100111101011101000010 f0af8ebdeaa58ebdebf0ceebccc0e9e5d3d7f0af8ebdeaa58ebdebf0ceebccc0e9e5d3d742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)