To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????節??汚 001111110011111100111111001111110011111100111111100100001101111100111111001111111000100110011000 3f3f3f3f3f3f90df3f3f8998
EUC-JP ??????節??汚 001111110011111100111111001111110011111100111111110000001110000100111111001111111011000111111000 3f3f3f3f3f3fc0e13f3fb1f8
UTF-8 咽됱쥒璘뗥퐲節꾪벃汚 111011111010011010011110111010111001000010110001111011001010010110010010111011111010011110101111111010111001011110100101111011011001000010110010111001111010111110000000111010101011111010101010111010111011001010000011111001101011000110011010 efa69eeb90b1eca592efa7afeb97a5ed90b2e7af80eabeaaebb283e6b19a
UHC 咽됱쥒璘뗥퐲節꾪벃汚 1110011011101100100010011110110010100010100010011110110011011110100010111110010110111101100110111110111110111101100001001110110110010011101010011110011111111101 e6ec89eca289ecde8be5bd9befbd84ed93a9e7fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)