To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????|???? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110000111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7c3f3f3f3f
SJIS-WIN 筌??誼??釉??筌?????乙??|筌??誼 111000101010001100111111001111111000101101100010001111110011111111100111110101100011111100111111111000101010001100111111001111110011111100111111001111111000100110110011001111110011111101111100111000101010001100111111001111111000101101100010 e2a33f3f8b623f3fe7d63f3fe2a33f3f3f3f3f89b33f3f7ce2a33f3f8b62
EUC-JP 筌??誼??釉??筌?????乙??|筌??誼 111001001010010100111111001111111011010111000011001111110011111111101110110110000011111100111111111001001010010100111111001111110011111100111111001111111011001010110101001111110011111101111100111001001010010100111111001111111011010111000011 e4a53f3fb5c33f3feed83f3fe4a53f3f3f3f3fb2b53f3f7ce4a53f3fb5c3
UTF-8 筌뗭궠誼뉛쭓釉띿죦筌뗫렇麟놅쭓乙녹젃|筌뗭궠誼 11100111101011011000110011101011100101111010110111101010101101101010000011101000101010101011110011101011100010011001101111101100101011011001001111101001100001111000100111101011100111011011111111101100101000111010011011100111101011011000110011101011100101111010101111101011101000001000011111101111101001111011001111101011100001101000010111101100101011011001001111100100101110011001100111101011100001011011100111101100101000001000001101111100111001111010110110001100111010111001011110101101111010101011011010100000111010001010101010111100 e7ad8ceb97adeab6a0e8aabceb899becad93e98789eb9dbfeca3a6e7ad8ceb97abeba087efa7b3eb8685ecad93e4b999eb85b9eca0837ce7ad8ceb97adeab6a0e8aabc
UHC 筌뗭궠誼뉛쭓釉띿죦筌뗫렇麟놅쭓乙녹젃|筌뗭궠誼 111011111010011110001011111011001000001010110011111010111111111010000111111011111010011110001011111010111011100010001101111011001010000110000001111011111010011110001011111010111011011110111000111011001110100010000110111011111010011110001011111010111110000010110011111011001010000010000111011111001110111110100111100010111110110010000010101100111110101111111110 efa78bec82b3ebfe87efa78bebb88deca181efa78bebb7b8ece886efa78bebe0b3eca0877cefa78bec82b3ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)