To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN セャ者樵ィ篠セュ痔セャ聳セャ者樵ィ篠セュ痔セャ聢^ 1011111010101100100011101101001010001111101111111010100010001110110000101011111010101101100011101010010010111110101011001110001111011110101111101010110010001110110100101000111110111111101010001000111011000010101111101010110110001110101001001011111010101100111000111101110001011110 beac8ed28fbfa88ec2bead8ea4beace3debeac8ed28fbfa88ec2bead8ea4beace3dc5e
EUC-JP セャ者樵ィ篠セュ痔セャ聳セャ者樵ィ篠セュ痔セャ聢^ 10001110101111101000111010101100101111001101010010111110110000011000111010101000101111001100010010001110101111101000111010101101101111001010011010001110101111101000111010101100111001101110000010001110101111101000111010101100101111001101010010111110110000011000111010101000101111001100010010001110101111101000111010101101101111001010011010001110101111101000111010101100111001101101111001011110 8ebe8eacbcd4bec18ea8bcc48ebe8eadbca68ebe8eace6e08ebe8eacbcd4bec18ea8bcc48ebe8eadbca68ebe8eace6de5e
UTF-8 セャ者樵ィ篠セュ痔セャ聳セャ者樵ィ篠セュ痔セャ聢^ 11101111101111011011111011101111101111011010110011101000100000001000010111100110101010001011010111101111101111011010100011100111101011111010000011101111101111011011111011101111101111011010110111100111100101111001010011101111101111011011111011101111101111011010110011101000100000011011001111101111101111011011111011101111101111011010110011101000100000001000010111100110101010001011010111101111101111011010100011100111101011111010000011101111101111011011111011101111101111011010110111100111100101111001010011101111101111011011111011101111101111011010110011101000100000011010001001011110 efbdbeefbdace88085e6a8b5efbda8e7afa0efbdbeefbdade79794efbdbeefbdace881b3efbdbeefbdace88085e6a8b5efbda8e7afa0efbdbeefbdade79794efbdbeefbdace881a25e
UHC ??者樵?篠??痔??聳??者樵?篠??痔???^ 00111111001111111110110110111010111101011010001100111111111000011100011000111111001111111111011011000000001111110011111111101001110001100011111100111111111011011011101011110101101000110011111111100001110001100011111100111111111101101100000000111111001111110011111101011110 3f3fedbaf5a33fe1c63f3ff6c03f3fe9c63f3fedbaf5a33fe1c63f3ff6c03f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)