To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 障蕎????♂制??障蕎????♂制??B 1000111111100001100010111011110000111111001111110011111100111111100000011000100110010000101001110011111100111111100011111110000110001011101111000011111100111111001111110011111110000001100010011001000010100111001111110011111101000010 8fe18bbc3f3f3f3f818990a73f3f8fe18bbc3f3f3f3f818990a73f3f42
EUC-JP 障蕎????♂制??障蕎????♂制??B 1011111011100011101101101011111000111111001111110011111100111111101000011110100111000000101010010011111100111111101111101110001110110110101111100011111100111111001111110011111110100001111010011100000010101001001111110011111101000010 bee3b6be3f3f3f3fa1e9c0a93f3fbee3b6be3f3f3f3fa1e9c0a93f3f42
UTF-8 障蕎틔뤱횓등♂制₃렓障蕎틔뤱횓등♂制₃렓B 11101001100110101001110011101000100101011000111011101101100010111001010011101011101001001011000111101101100110101001001111101011100100111011000111100010100110011000001011100101100010001011011011100010100000101000001111101011101000001001001111101001100110101001110011101000100101011000111011101101100010111001010011101011101001001011000111101101100110101001001111101011100100111011000111100010100110011000001011100101100010001011011011100010100000101000001111101011101000001001001101000010 e99a9ce8958eed8b94eba4b1ed9a93eb93b1e29982e588b6e28283eba093e99a9ce8958eed8b94eba4b1ed9a93eb93b1e29982e588b6e28283eba09342
UHC 障蕎틔뤱횓등♂制₃렓障蕎틔뤱횓등♂制₃렓B 1110111010100001110011101111000011000110101101111000111111011111110000111000111010110101111011101010000111001110111100001010010010101001111111011000111010101000111011101010000111001110111100001100011010110111100011111101111111000011100011101011010111101110101000011100111011110000101001001010100111111101100011101010100001000010 eea1cef0c6b78fdfc38eb5eea1cef0a4a9fd8ea8eea1cef0c6b78fdfc38eb5eea1cef0a4a9fd8ea842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)