To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??肉??淫??筌??誼??乙??筌?? 11100010101000110011111100111111100100111111011100111111001111111000100011111010001111110011111111100010101000110011111100111111100010110110001000111111001111111000100110110011001111110011111111100010101000110011111100111111 e2a33f3f93f73f3f88fa3f3fe2a33f3f8b623f3f89b33f3fe2a33f3f
EUC-JP 筌??肉??淫??筌??誼??乙??筌?? 11100100101001010011111100111111110001101111100100111111001111111011000011111100001111110011111111100100101001010011111100111111101101011100001100111111001111111011001010110101001111110011111111100100101001010011111100111111 e4a53f3fc6f93f3fb0fc3f3fe4a53f3fb5c33f3fb2b53f3fe4a53f3f
UTF-8 筌뗭궏肉쏙쭅淫롮젩筌뗭궠誼됵쭓乙쇈뀆筌뗭벱 111001111010110110001100111010111001011110101101111010101011011010001111111010001000001010001001111011001000111110011001111011001010110110000101111001101011011110101011111010111010000110101110111011001010000010101001111001111010110110001100111010111001011110101101111010101011011010100000111010001010101010111100111010111001000010110101111011001010110110010011111001001011100110011001111011001000011110001000111010111000000010000110111001111010110110001100111010111001011110101101111010111011001010110001 e7ad8ceb97adeab68fe88289ec8f99ecad85e6b7abeba1aeeca0a9e7ad8ceb97adeab6a0e8aabceb90b5ecad93e4b999ec8788eb8086e7ad8ceb97adebb2b1
UHC 筌뗭궏肉쏙쭅淫롮젩筌뗭궠誼됵쭓乙쇈뀆筌뗭벱 111011111010011110001011111011001000001010100101111010111011111110111101111011111010011110000001111010111110001010001110111011001010000010100001111011111010011110001011111011001000001010110011111010111111111010001001111011111010011110001011111010111110000010111100111000111000010110000010111011111010011110001011111011001011101010101001 efa78bec82a5ebbfbdefa781ebe28eeca0a1efa78bec82b3ebfe89efa78bebe0bce38582efa78becbaa9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)