To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????而?祭攀???????而?祭攀???B 0011111100111111001111110011111110001110101001110011111110001101110101011001110110110011001111110011111100111111001111110011111100111111001111111000111010100111001111111000110111010101100111011011001100111111001111110011111101000010 3f3f3f3f8ea73f8dd59db33f3f3f3f3f3f3f8ea73f8dd59db33f3f3f42
EUC-JP ????而?祭攀???????而?祭攀???B 0011111100111111001111110011111110111100101010010011111110111010110101111101101010110101001111110011111100111111001111110011111100111111001111111011110010101001001111111011101011010111110110101011010100111111001111110011111101000010 3f3f3f3fbca93fbad7dab53f3f3f3f3f3f3fbca93fbad7dab53f3f3f42
UTF-8 炡쇨렮렪而렲祭攀렖꿰쓱炡쇨렮렪而렲祭攀렖꿰쓱B 11100111100000101010000111101100100001111010100011101011101000001010111011101011101000001010101011101000100000001000110011101011101000001011001011100111101001011010110111100110100101001000000011101011101000001001011011101010101111111011000011101100100100111011000111100111100000101010000111101100100001111010100011101011101000001010111011101011101000001010101011101000100000001000110011101011101000001011001011100111101001011010110111100110100101001000000011101011101000001001011011101010101111111011000011101100100100111011000101000010 e782a1ec87a8eba0aeeba0aae8808ceba0b2e7a5ade69480eba096eabfb0ec93b1e782a1ec87a8eba0aeeba0aae8808ceba0b2e7a5ade69480eba096eabfb0ec93b142
UHC 炡쇨렮렪而렲祭攀렖꿰쓱炡쇨렮렪而렲祭攀렖꿰쓱B 111011111110100010111100111010101000111010111011100011101011100011101100101110111000111010111111111100001010111011011010111001111000111010101011101100101110011110111110101100111110111111101000101111001110101010001110101110111000111010111000111011001011101110001110101111111111000010101110110110101110011110001110101010111011001011100111101111101011001101000010 efe8bcea8ebb8eb8ecbb8ebff0aedae78eabb2e7beb3efe8bcea8ebb8eb8ecbb8ebff0aedae78eabb2e7beb342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)