To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??逾??臾??葉??誼??乙μ?筌?? 1110001010100011001111110011111111100111101001010011111100111111111001000110101100111111001111111001011101110100001111110011111110001011011000100011111100111111100010011011001110000011110010100011111111100010101000110011111100111111 e2a33f3fe7a53f3fe46b3f3f97743f3f8b623f3f89b383ca3fe2a33f3f
EUC-JP 筌??逾??臾??葉??誼??乙μ?筌?? 1110010010100101001111110011111111101110101001110011111100111111111001111100110000111111001111111100110111010101001111110011111110110101110000110011111100111111101100101011010110100110110011000011111111100100101001010011111100111111 e4a53f3feea73f3fe7cc3f3fcdd53f3fb5c33f3fb2b5a6cc3fe4a53f3f
UTF-8 筌뗫툙逾울쫰臾볦뜏葉붾굝誼섓쭓乙μ빼筌뗭텫 1110011110101101100011001110101110010111101010111110110110001000100110011110100110000000101111101110110010011010101110001110110010101011101100001110100010000111101111101110101110110011101001101110101110011100100011111110100010010001100010011110101110110110101111101110101010110101100111011110100010101010101111001110110010000100100100111110110010101101100100111110010010111001100110011100111010111100111010111011100110111100111001111010110110001100111010111001011110101101111011011000010110101011 e7ad8ceb97abed8899e980beec9ab8ecabb0e887beebb3a6eb9c8fe89189ebb6beeab59de8aabcec8493ecad93e4b999cebcebb9bce7ad8ceb97aded85ab
UHC 筌뗫툙逾울쫰臾볦뜏葉붾굝誼섓쭓乙μ빼筌뗭텫 111011111010011110001011111010111011100010010000111010111011010110111111111011111010011010001000111010111010110010010011111011001000110110010010111001111010100010010100111010111000001010000101111010111111111010011000111011111010011110001011111010111110000010100101111011001011101110101001111011111010011110001011111011001011011010011111 efa78bebb890ebb5bfefa688ebac93ec8d92e7a894eb8285ebfe98efa78bebe0a5ecbba9efa78becb69f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)