To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ハス治鴫湿}ハス治鴫湿{^ 1111000111100100110010101111000110001110101111011000111010100001111100001011010110001110101100001111000010110100100011101011110001111101111100011110010011001010111100011000111010111101100011101010000111110000101101011000111010110000111100001011010010001110101111000111101101011110 f1e4caf18ebd8ea1f0b58eb0f0b48ebc7df1e4caf18ebd8ea1f0b58eb0f0b48ebc7b5e
EUC-JP ?ハ?ス治?鴫?湿}?ハ?ス治?鴫?湿{^ 00111111100011101100101000111111100011101011110110111100101000110011111110111100101100100011111110111100101111100111110100111111100011101100101000111111100011101011110110111100101000110011111110111100101100100011111110111100101111100111101101011110 3f8eca3f8ebdbca33fbcb23fbcbe7d3f8eca3f8ebdbca33fbcb23fbcbe7b5e
UTF-8 ハス治鴫湿}ハス治鴫湿{^ 111011101000010110011111111011111011111010001010111011101000010010001001111011111011110110111101111001101011001010111011111011101000000110110100111010011011010010101011111011101000000110110011111001101011100110111111011111011110111010000101100111111110111110111110100010101110111010000100100010011110111110111101101111011110011010110010101110111110111010000001101101001110100110110100101010111110111010000001101100111110011010111001101111110111101101011110 ee859fefbe8aee8489efbdbde6b2bbee81b4e9b4abee81b3e6b9bf7dee859fefbe8aee8489efbdbde6b2bbee81b4e9b4abee81b3e6b9bf7b5e
UHC ????治????}????治????{^ 0011111100111111001111110011111111110110101111010011111100111111001111110011111101111101001111110011111100111111001111111111011010111101001111110011111100111111001111110111101101011110 3f3f3f3ff6bd3f3f3f3f7d3f3f3f3ff6bd3f3f3f3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)