To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 症?正?症?隅?n}症?正?症?隅?n{^ 1000111111000111001111111001000010110011001111111000111111000111001111111000101111110111001111110110111001111101100011111100011100111111100100001011001100111111100011111100011100111111100010111111011100111111011011100111101101011110 8fc73f90b33f8fc73f8bf73f6e7d8fc73f90b33f8fc73f8bf73f6e7b5e
EUC-JP 症?正?症?隅?n}症?正?症?隅?n{^ 1011111011001001001111111100000010110101001111111011111011001001001111111011011011111001001111110110111001111101101111101100100100111111110000001011010100111111101111101100100100111111101101101111100100111111011011100111101101011110 bec93fc0b53fbec93fb6f93f6e7dbec93fc0b53fbec93fb6f93f6e7b5e
UTF-8 症렊正렱症렊隅렣n}症렊正렱症렊隅렣n{^ 1110011110010111100001111110101110100000100010101110011010101101101000111110101110100000101100011110011110010111100001111110101110100000100010101110100110011010100001011110101110100000101000110110111001111101111001111001011110000111111010111010000010001010111001101010110110100011111010111010000010110001111001111001011110000111111010111010000010001010111010011001101010000101111010111010000010100011011011100111101101011110 e79787eba08ae6ada3eba0b1e79787eba08ae99a85eba0a36e7de79787eba08ae6ada3eba0b1e79787eba08ae99a85eba0a36e7b5e
UHC 症렊正렱症렊隅렣n}症렊正렱症렊隅렣n{^ 11110001111110001000111010100001111011111110000110001110101111101111000111111000100011101010000111101001111010101000111010110100011011100111110111110001111110001000111010100001111011111110000110001110101111101111000111111000100011101010000111101001111010101000111010110100011011100111101101011110 f1f88ea1efe18ebef1f88ea1e9ea8eb46e7df1f88ea1efe18ebef1f88ea1e9ea8eb46e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)