To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 汚??節??裔?汚??節??裔?B 1000100110011000001111110011111110010000110111110011111100111111111001011110000100111111100010011001100000111111001111111001000011011111001111110011111111100101111000010011111101000010 89983f3f90df3f3fe5e13f89983f3f90df3f3fe5e13f42
EUC-JP 汚??節??裔?汚??節??裔?B 1011000111111000001111110011111111000000111000010011111100111111111010101110001100111111101100011111100000111111001111111100000011100001001111110011111111101010111000110011111101000010 b1f83f3fc0e13f3feae33fb1f83f3fc0e13f3feae33f42
UTF-8 汚얕닃節뤺뜷裔챜汚얕닃節뤺뜷裔챜B 11100110101100011001101011101100100101101001010111101011100010111000001111100111101011111000000011101011101001001011101011101011100111001011011111101000101000111001010011101100101100011001110011100110101100011001101011101100100101101001010111101011100010111000001111100111101011111000000011101011101001001011101011101011100111001011011111101000101000111001010011101100101100011001110001000010 e6b19aec9695eb8b83e7af80eba4baeb9cb7e8a394ecb19ce6b19aec9695eb8b83e7af80eba4baeb9cb7e8a394ecb19c42
UHC 汚얕닃節뤺뜷裔챜汚얕닃節뤺뜷裔챜B 111001111111110110111110111010001000100010001100111011111011110110001111111010001000110110110101111001111110000010101010011010011110011111111101101111101110100010001000100011001110111110111101100011111110100010001101101101011110011111100000101010100110100101000010 e7fdbee8888cefbd8fe88db5e7e0aa69e7fdbee8888cefbd8fe88db5e7e0aa6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)