To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 淙?正?淙?隅?n}淙?正?淙?隅?n{^ 1001111111001000001111111001000010110011001111111001111111001000001111111000101111110111001111110110111001111101100111111100100000111111100100001011001100111111100111111100100000111111100010111111011100111111011011100111101101011110 9fc83f90b33f9fc83f8bf73f6e7d9fc83f90b33f9fc83f8bf73f6e7b5e
EUC-JP 淙?正?淙?隅?n}淙?正?淙?隅?n{^ 1101111011001010001111111100000010110101001111111101111011001010001111111011011011111001001111110110111001111101110111101100101000111111110000001011010100111111110111101100101000111111101101101111100100111111011011100111101101011110 deca3fc0b53fdeca3fb6f93f6e7ddeca3fc0b53fdeca3fb6f93f6e7b5e
UTF-8 淙렊正렱淙렊隅렣n}淙렊正렱淙렊隅렣n{^ 1110011010110111100110011110101110100000100010101110011010101101101000111110101110100000101100011110011010110111100110011110101110100000100010101110100110011010100001011110101110100000101000110110111001111101111001101011011110011001111010111010000010001010111001101010110110100011111010111010000010110001111001101011011110011001111010111010000010001010111010011001101010000101111010111010000010100011011011100111101101011110 e6b799eba08ae6ada3eba0b1e6b799eba08ae99a85eba0a36e7de6b799eba08ae6ada3eba0b1e6b799eba08ae99a85eba0a36e7b5e
UHC 淙렊正렱淙렊隅렣n}淙렊正렱淙렊隅렣n{^ 11110000111110001000111010100001111011111110000110001110101111101111000011111000100011101010000111101001111010101000111010110100011011100111110111110000111110001000111010100001111011111110000110001110101111101111000011111000100011101010000111101001111010101000111010110100011011100111101101011110 f0f88ea1efe18ebef0f88ea1e9ea8eb46e7df0f88ea1efe18ebef0f88ea1e9ea8eb46e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)