To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???嶽??????????嶽???????^ 00111111001111110011111110011011110101000011111100111111001111110011111100111111001111110011111100111111001111110011111110011011110101000011111100111111001111110011111100111111001111110011111101011110 3f3f3f9bd43f3f3f3f3f3f3f3f3f3f9bd43f3f3f3f3f3f3f5e
EUC-JP ???嶽??????????嶽???????^ 00111111001111110011111111010110110101100011111100111111001111110011111100111111001111110011111100111111001111110011111111010110110101100011111100111111001111110011111100111111001111110011111101011110 3f3f3fd6d63f3f3f3f3f3f3f3f3f3fd6d63f3f3f3f3f3f3f5e
UTF-8 센셈센嶽셋센솬렯렱렯롃센셈센嶽셋센솩렯렱렯롏^ 11101100100001001011110011101100100001011000100011101100100001001011110011100101101101101011110111101100100001011000101111101100100001001011110011101100100001101010110011101011101000001010111111101011101000001011000111101011101000001010111111101011101000011000001111101100100001001011110011101100100001011000100011101100100001001011110011100101101101101011110111101100100001011000101111101100100001001011110011101100100001101010100111101011101000001010111111101011101000001011000111101011101000001010111111101011101000011000111101011110 ec84bcec8588ec84bce5b6bdec858bec84bcec86aceba0afeba0b1eba0afeba183ec84bcec8588ec84bce5b6bdec858bec84bcec86a9eba0afeba0b1eba0afeba18f5e
UHC 센셈센嶽셋센솬렯렱렯롃센셈센嶽셋센솩렯렱렯롏^ 101111001011111010111100110000001011110010111110111001001100000010111100110000101011110010111110101111001101111110001110101111001000111010111110100011101011110010001110110010101011110010111110101111001100000010111100101111101110010011000000101111001100001010111100101111101011110011011110100011101011110010001110101111101000111010111100100011101101010101011110 bcbebcc0bcbee4c0bcc2bcbebcdf8ebc8ebe8ebc8ecabcbebcc0bcbee4c0bcc2bcbebcde8ebc8ebe8ebc8ed55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)