To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????\n}???????\n{^ 001111110011111100111111001111110011111100111111001111110101110001101110011111010011111100111111001111110011111100111111001111110011111101011100011011100111101101011110 3f3f3f3f3f3f3f5c6e7d3f3f3f3f3f3f3f5c6e7b5e
SJIS-WIN 痍???而驀缺\n}痍???而驀缺\n{^ 1110000101110111001111110011111100111111100011101010011111101001011111011110001110011110010111000110111001111101111000010111011100111111001111110011111110001110101001111110100101111101111000111001111001011100011011100111101101011110 e1773f3f3f8ea7e97de39e5c6e7de1773f3f3f8ea7e97de39e5c6e7b5e
EUC-JP 痍?檉?而驀缺\n}痍?檉?而驀缺\n{^ 111000011101100000111111100011111100010110111011001111111011110010101001111100011101111011100101111111100101110001101110011111011110000111011000001111111000111111000101101110110011111110111100101010011111000111011110111001011111111001011100011011100111101101011110 e1d83f8fc5bb3fbca9f1dee5fe5c6e7de1d83f8fc5bb3fbca9f1dee5fe5c6e7b5e
UTF-8 痍렞檉렢而驀缺\n}痍렞檉렢而驀缺\n{^ 11100111100101111000110111101011101000001001111011100110101010101000100111101011101000001010001011101000100000001000110011101001101010011000000011100111101111001011101001011100011011100111110111100111100101111000110111101011101000001001111011100110101010101000100111101011101000001010001011101000100000001000110011101001101010011000000011100111101111001011101001011100011011100111101101011110 e7978deba09ee6aa89eba0a2e8808ce9a980e7bcba5c6e7de7978deba09ee6aa89eba0a2e8808ce9a980e7bcba5c6e7b5e
UHC 痍렞檉렢而驀缺\n}痍렞檉렢而驀缺\n{^ 1110110010110111100011101010111111101111111000001000111010110011111011001011101111011000111010011100110011000000010111000110111001111101111011001011011110001110101011111110111111100000100011101011001111101100101110111101100011101001110011001100000001011100011011100111101101011110 ecb78eafefe08eb3ecbbd8e9ccc05c6e7decb78eafefe08eb3ecbbd8e9ccc05c6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)