To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 玉??違?????[玉??違?????[^ 10001011110010100011111100111111100010001110000100111111001111110011111100111111001111110101101110001011110010100011111100111111100010001110000100111111001111110011111100111111001111110101101101011110 8bca3f3f88e13f3f3f3f3f5b8bca3f3f88e13f3f3f3f3f5b5e
EUC-JP 玉??違??洹??[玉??違??洹??[^ 1011011011001100001111110011111110110000111000110011111100111111100011111100011110111010001111110011111101011011101101101100110000111111001111111011000011100011001111110011111110001111110001111011101000111111001111110101101101011110 b6cc3f3fb0e33f3f8fc7ba3f3f5bb6cc3f3fb0e33f3f8fc7ba3f3f5b5e
UTF-8 玉좉꼍違방떤洹잙젚[玉좉꼍違방떤洹잙젚[^ 111001111000111010001001111011001010001010001001111010101011110010001101111010011000000110010101111010111011000010101001111010111001011010100100111001101011010010111001111011001001111010011001111011001010000010011010010110111110011110001110100010011110110010100010100010011110101010111100100011011110100110000001100101011110101110110000101010011110101110010110101001001110011010110100101110011110110010011110100110011110110010100000100110100101101101011110 e78e89eca289eabc8de98195ebb0a9eb96a4e6b4b9ec9e99eca09a5be78e89eca289eabc8de98195ebb0a9eb96a4e6b4b9ec9e99eca09a5b5e
UHC 玉좉꼍違방떤洹잙젚[玉좉꼍違방떤洹잙젚[^ 111010001010110010100000111010101011001010111101111010101101111010111001111001101011011010110010111010101011011110011111111010111010000010010110010110111110100010101100101000001110101010110010101111011110101011011110101110011110011010110110101100101110101010110111100111111110101110100000100101100101101101011110 e8aca0eab2bdeadeb9e6b6b2eab79feba0965be8aca0eab2bdeadeb9e6b6b2eab79feba0965b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)