To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 障尞テヲハシトイ}障尞テヲハシトイ{^ 100011111110000111111010101010111100001110100110110010101011110011000100101100101111010010100101011111011000111111100001111110101010101111000011101001101100101010111100110001001011001011110100101001010111101101011110 8fe1faabc3a6cabcc4b2f4a57d8fe1faabc3a6cabcc4b2f4a57b5e
EUC-JP 障尞テヲハシトイ?}障尞テヲハシトイ?{^ 101111101110001110001111101110101110101110001110110000111000111010100110100011101100101010001110101111001000111011000100100011101011001000111111011111011011111011100011100011111011101011101011100011101100001110001110101001101000111011001010100011101011110010001110110001001000111010110010001111110111101101011110 bee38fbaeb8ec38ea68eca8ebc8ec48eb23f7dbee38fbaeb8ec38ea68eca8ebc8ec48eb23f7b5e
UTF-8 障尞テヲハシトイ}障尞テヲハシトイ{^ 111010011001101010011100111001011011000010011110111011111011111010000011111011111011110110100110111011111011111010001010111011111011110110111100111011111011111010000100111011111011110110110010111011101000110110010100011111011110100110011010100111001110010110110000100111101110111110111110100000111110111110111101101001101110111110111110100010101110111110111101101111001110111110111110100001001110111110111101101100101110111010001101100101000111101101011110 e99a9ce5b09eefbe83efbda6efbe8aefbdbcefbe84efbdb2ee8d947de99a9ce5b09eefbe83efbda6efbe8aefbdbcefbe84efbdb2ee8d947b5e
UHC 障????????}障????????{^ 1110111010100001001111110011111100111111001111110011111100111111001111110011111101111101111011101010000100111111001111110011111100111111001111110011111100111111001111110111101101011110 eea13f3f3f3f3f3f3f3f7deea13f3f3f3f3f3f3f3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)