To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 症贇セレセラマ}症贇セレセラマ{^ 1000111111000111111001101101010011110001100011101011111011011010111100011000111010111110110101111100111101111101100011111100011111100110110101001111000110001110101111101101101011110001100011101011111011010111110011110111101101011110 8fc7e6d4f18ebedaf18ebed7cf7d8fc7e6d4f18ebedaf18ebed7cf7b5e
EUC-JP 症贇?セレ?セラマ}症贇?セレ?セラマ{^ 1011111011001001111011001101011000111111100011101011111010001110110110100011111110001110101111101000111011010111100011101100111101111101101111101100100111101100110101100011111110001110101111101000111011011010001111111000111010111110100011101101011110001110110011110111101101011110 bec9ecd63f8ebe8eda3f8ebe8ed78ecf7dbec9ecd63f8ebe8eda3f8ebe8ed78ecf7b5e
UTF-8 症贇セレセラマ}症贇セレセラマ{^ 111001111001011110000111111010001011010010000111111011101000010010001001111011111011110110111110111011111011111010011010111011101000010010001001111011111011110110111110111011111011111010010111111011111011111010001111011111011110011110010111100001111110100010110100100001111110111010000100100010011110111110111101101111101110111110111110100110101110111010000100100010011110111110111101101111101110111110111110100101111110111110111110100011110111101101011110 e79787e8b487ee8489efbdbeefbe9aee8489efbdbeefbe97efbe8f7de79787e8b487ee8489efbdbeefbe9aee8489efbdbeefbe97efbe8f7b5e
UHC 症贇???????}症贇???????{^ 11110001111110001110101111001011001111110011111100111111001111110011111100111111001111110111110111110001111110001110101111001011001111110011111100111111001111110011111100111111001111110111101101011110 f1f8ebcb3f3f3f3f3f3f3f7df1f8ebcb3f3f3f3f3f3f3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)