To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓??魏??遺??艶l?淫??碎??孃 111010001011110100111111001111111110100110110000001111110011111110001000111000100011111100111111100010011001000010000010100011000011111110001000111110100011111100111111111000011110101000111111001111111001101101101111 e8bd3f3fe9b03f3f88e23f3f8990828c3f88fa3f3fe1ea3f3f9b6f
EUC-JP 霓??魏??遺??艶l?淫??碎??孃 111100001011111100111111001111111111001010110010001111110011111110110000111001000011111100111111101100011111000010100011111011000011111110110000111111000011111100111111111000101110110000111111001111111101010111010000 f0bf3f3ff2b23f3fb0e43f3fb1f0a3ec3fb0fc3f3fe2ec3f3fd5d0
UTF-8 霓낅뜄魏뚦젆遺밟뀻艶l꼵淫먪쥗碎밸즵孃 111010011001110010010011111010111000001010000101111010111001110010000100111010011010110110001111111010111001101010100110111011001010000010000110111010011000000110111010111010111011000010011111111010111000000010111011111010001000100110110110111011111011110110001100111010101011110010110101111001101011011110101011111010111010100010101010111011001010010110010111111001111010001010001110111010111011000010111000111011001010011010110101111001011010110110000011 e99c93eb8285eb9c84e9ad8feb9aa6eca086e981baebb09feb80bbe889b6efbd8ceabcb5e6b7abeba8aaeca597e7a28eebb0b8eca6b5e5ad83
UHC 霓낅뜄魏뚦젆遺밟뀻艶l꼵淫먪쥗碎밸즵孃 1110011111100111100001011110101110001101100010001110101011100000100011001110010110100000100010011110101110110110101110011110001010000101101100011110011011111101101000111110110010000100100011011110101111100010100100001110011110100010100011011110000111101111101110011110101110100011100001111110010110111110 e7e785eb8d88eae08ce5a089ebb6b9e285b1e6fda3ec848debe290e7a28de1efb9eba387e5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)