To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???猷?ぜ???筌??誼ワ┤幽??沃?? 0011111100111111001111111001011101010001001111111000001010111010001111110011111100111111111000101010001100111111001111111000101101100010100000111000111110000100101001111001011101001000001111110011111110010111100000000011111100111111 3f3f3f97513f82ba3f3f3fe2a33f3f8b62838f84a797483f3f97803f3f
EUC-JP ???猷?ぜ洧??筌??誼ワ┤幽??沃?? 00111111001111110011111111001101101100100011111110100100101111001000111111000111101101000011111100111111111001001010010100111111001111111011010111000011101001011110111110101000101010011100110110101001001111110011111111001101111000000011111100111111 3f3f3fcdb23fa4bc8fc7b43f3fe4a53f3fb5c3a5efa8a9cda93f3fcde03f3f
UTF-8 劣믩뗄猷녻ぜ洧븍븸筌먲퐣誼ワ┤幽덉뒧沃쇰쁾 111011111010011010011101111010111010111110101001111010111001011110000100111001111000110010110111111010111000010110111011111000111000000110011100111001101011010010100111111010111011100010001101111010111011100010111000111001111010110110001100111010111010100010110010111011011001000010100011111010001010101010111100111000111000001110101111111000101001010010100100111001011011100110111101111010111000110110001001111010111001001010100111111001101011001010000011111011001000011110110000111011001000000110111110 efa69debafa9eb9784e78cb7eb85bbe3819ce6b4a7ebb88debb8b8e7ad8ceba8b2ed90a3e8aabce383afe294a4e5b9bdeb8d89eb92a7e6b283ec87b0ec81be
UHC 劣믩뗄猷녻ぜ洧븍븸筌먲퐣誼ワ┤幽덉뒧沃쇰쁾 111001101110101110010010111010111011011010111111111010111010001110000110111010001010101010111100111010101111101110111010111010111001010110100001111011111010011110010000111011111011110110001100111010111111111010101011111011111010011010101001111010101110101110001000111011001000101010100010111010001010101010111100111010111001100010000101 e6eb92ebb6bfeba386e8aabceafbbaeb95a1efa790efbd8cebfeabefa6a9eaeb88ec8aa2e8aabceb9885

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)