To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 猷??殉?6伊??[猷??殉?6伊??[^ 1001011101010001001111110011111110001111011111010011111110000010010101011000100011001001001111110011111101011011100101110101000100111111001111111000111101111101001111111000001001010101100010001100100100111111001111110101101101011110 97513f3f8f7d3f825588c93f3f5b97513f3f8f7d3f825588c93f3f5b5e
EUC-JP 猷??殉?6伊??[猷??殉?6伊??[^ 1100110110110010001111110011111110111101110111100011111110100011101101101011000011001011001111110011111101011011110011011011001000111111001111111011110111011110001111111010001110110110101100001100101100111111001111110101101101011110 cdb23f3fbdde3fa3b6b0cb3f3f5bcdb23f3fbdde3fa3b6b0cb3f3f5b5e
UTF-8 猷띠쪡殉쏅6伊싥닗[猷띠쪡殉쏅6伊싥닗[^ 111001111000110010110111111010111001110110100000111011001010101010100001111001101010111010001001111011001000111110000101111011111011110010010110111001001011110010001010111011001000101110100101111010111000101110010111010110111110011110001100101101111110101110011101101000001110110010101010101000011110011010101110100010011110110010001111100001011110111110111100100101101110010010111100100010101110110010001011101001011110101110001011100101110101101101011110 e78cb7eb9da0ecaaa1e6ae89ec8f85efbc96e4bc8aec8ba5eb8b975be78cb7eb9da0ecaaa1e6ae89ec8f85efbc96e4bc8aec8ba5eb8b975b5e
UHC 猷띠쪡殉쏅6伊싥닗[猷띠쪡殉쏅6伊싥닗[^ 111010111010001110110110111011001010010110011010111000101110011010011011111010111010001110110110111011001010010110011010111000111000100010011011010110111110101110100011101101101110110010100101100110101110001011100110100110111110101110100011101101101110110010100101100110101110001110001000100110110101101101011110 eba3b6eca59ae2e69beba3b6eca59ae3889b5beba3b6eca59ae2e69beba3b6eca59ae3889b5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)