To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 玉??節??鈺?? 100010111100101000111111001111111001000011011111001111110011111111111011110001000011111100111111 8bca3f3f90df3f3ffbc43f3f
EUC-JP 玉??節??鈺?? 10110110110011000011111100111111110000001110000100111111001111111000111111100011110101010011111100111111 b6cc3f3fc0e13f3f8fe3d53f3f
UTF-8 玉뽪첁節듸쉿鈺잞슭 111001111000111010001001111010111011110110101010111011001011001010000001111001111010111110000000111010111001001110111000111011001000100110111111111010011000100010111010111011001001111010011110111011001000101010101101 e78e89ebbdaaecb281e7af80eb93b8ec89bfe988baec9e9eec8aad
UHC 玉뽪첁節듸쉿鈺잞슭 111010001010110010010110111001101010101010001110111011111011110110110101111011111011110110110010111010001010110110011111111011111011110110111110 e8ac96e6aa8eefbdb5efbdb2e8ad9fefbdbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)