To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 猷?┰循?5鼇γ?猷?┰循?5鼇γ?^ 10010111010100010011111110000100101110111000111101111010001111111000001001010100111010101000011110000011110000010011111110010111010100010011111110000100101110111000111101111010001111111000001001010100111010101000011110000011110000010011111101011110 97513f84bb8f7a3f8254ea8783c13f97513f84bb8f7a3f8254ea8783c13f5e
EUC-JP 猷?┰循?5鼇γ?猷?┰循?5鼇γ?^ 11001101101100100011111110101000101111011011110111011011001111111010001110110101111100111110011110100110110000110011111111001101101100100011111110101000101111011011110111011011001111111010001110110101111100111110011110100110110000110011111101011110 cdb23fa8bdbddb3fa3b5f3e7a6c33fcdb23fa8bdbddb3fa3b5f3e7a6c33f5e
UTF-8 猷띠┰循껊5鼇γ굝猷띠┰循껊5鼇γ굝^ 1110011110001100101101111110101110011101101000001110001010010100101100001110010110111110101010101110101010111011100010101110111110111100100101011110100110111100100001111100111010110011111010101011010110011101111001111000110010110111111010111001110110100000111000101001010010110000111001011011111010101010111010101011101110001010111011111011110010010101111010011011110010000111110011101011001111101010101101011001110101011110 e78cb7eb9da0e294b0e5beaaeabb8aefbc95e9bc87ceb3eab59de78cb7eb9da0e294b0e5beaaeabb8aefbc95e9bc87ceb3eab59d5e
UHC 猷띠┰循껊5鼇γ굝猷띠┰循껊5鼇γ굝^ 11101011101000111011011011101100101001101011110111100010111000001000001111101011101000111011010111101000101010001010010111100011100000101000010111101011101000111011011011101100101001101011110111100010111000001000001111101011101000111011010111101000101010001010010111100011100000101000010101011110 eba3b6eca6bde2e083eba3b5e8a8a5e38285eba3b6eca6bde2e083eba3b5e8a8a5e382855e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)