To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 而?怨?而?旭?而?旭?而?爾???凋 10001110101001110011111110001001100001010011111110001110101001110011111110001000101011100011111110001110101001110011111110001000101011100011111110001110101001110011111110001110101000100011111100111111001111111001001010011100 8ea73f89853f8ea73f88ae3f8ea73f88ae3f8ea73f8ea23f3f3f929c
EUC-JP 而?怨?而?旭?而?旭?而?爾???凋 10111100101010010011111110110001111001010011111110111100101010010011111110110000101100000011111110111100101010010011111110110000101100000011111110111100101010010011111110111100101001000011111100111111001111111100001111111100 bca93fb1e53fbca93fb0b03fbca93fb0b03fbca93fbca43f3f3fc3fc
UTF-8 而렲怨렊而렲旭렠而렲旭렠而렲爾잭렫렲凋 111010001000000010001100111010111010000010110010111001101000000010101000111010111010000010001010111010001000000010001100111010111010000010110010111001101001011110101101111010111010000010100000111010001000000010001100111010111010000010110010111001101001011110101101111010111010000010100000111010001000000010001100111010111010000010110010111001111000100010111110111011001001111010101101111010111010000010101011111010111010000010110010111001011000011110001011 e8808ceba0b2e680a8eba08ae8808ceba0b2e697adeba0a0e8808ceba0b2e697adeba0a0e8808ceba0b2e788beec9eadeba0abeba0b2e5878b
UHC 而렲怨렊而렲旭렠而렲旭렠而렲爾잭렫렲凋 1110110010111011100011101011111111101010101100111000111010100001111011001011101110001110101111111110100111101111100011101011000111101100101110111000111010111111111010011110111110001110101100011110110010111011100011101011111111101100101100111100000011101000100011101011100110001110101111111111000010111101 ecbb8ebfeab38ea1ecbb8ebfe9ef8eb1ecbb8ebfe9ef8eb1ecbb8ebfecb3c0e88eb98ebff0bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)