To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??誼??釉??筌??愉??儀??筌?? 11100010101000110011111100111111100010110110001000111111001111111110011111010110001111110011111111100010101000110011111100111111100101101111100100111111001111111000101101010110001111110011111111100010101000110011111100111111 e2a33f3f8b623f3fe7d63f3fe2a33f3f96f93f3f8b563f3fe2a33f3f
EUC-JP 筌??誼??釉??筌??愉??儀??筌?? 11100100101001010011111100111111101101011100001100111111001111111110111011011000001111110011111111100100101001010011111100111111110011001111101100111111001111111011010110110111001111110011111111100100101001010011111100111111 e4a53f3fb5c33f3feed83f3fe4a53f3fccfb3f3fb5b73f3fe4a53f3f
UTF-8 筌뗭궠誼랃쭓釉띿젷筌뗫툖愉쒎쳞儀뺣퓡筌뗭왇 111001111010110110001100111010111001011110101101111010101011011010100000111010001010101010111100111010111001111010000011111011001010110110010011111010011000011110001001111010111001110110111111111011001010000010110111111001111010110110001100111010111001011110101011111011011000100010010110111001101000010010001001111011001001001010001110111011001011001110011110111001011000010010000000111010111011101010100011111011011001001110100001111001111010110110001100111010111001011110101101111011001001100110000111 e7ad8ceb97adeab6a0e8aabceb9e83ecad93e98789eb9dbfeca0b7e7ad8ceb97abed8896e68489ec928eecb39ee58480ebbaa3ed93a1e7ad8ceb97adec9987
UHC 筌뗭궠誼랃쭓釉띿젷筌뗫툖愉쒎쳞儀뺣퓡筌뗭왇 111011111010011110001011111011001000001010110011111010111111111010001101111011111010011110001011111010111011100010001101111011001010000010101011111011111010011110001011111010111011100010001101111010101111000010011100111001011010101110000100111010111111000010010101111010111011111110001010111011111010011110001011111011001001111010111001 efa78bec82b3ebfe8defa78bebb88deca0abefa78bebb88deaf09ce5ab84ebf095ebbf8aefa78bec9eb9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)