To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???q?????? 00111111001111110011111101110001001111110011111100111111001111110011111100111111 3f3f3f713f3f3f3f3f3f
SJIS-WIN ???q橈??要?? 001111110011111100111111011100011001111011110100001111110011111110010111011101100011111100111111 3f3f3f719ef43f3f97763f3f
EUC-JP 旿??q橈??要?? 1000111111000001111101000011111100111111011100011101110011110110001111110011111111001101110101110011111100111111 8fc1f43f3f71dcf63f3fcdd73f3f
UTF-8 旿울쉘q橈롳쉼要뺝영 11100110100101111011111111101100100110101011100011101100100010011001100001110001111001101010100110001000111010111010000110110011111011001000100110111100111010001010011010000001111010111011101010011101111011001001100010000001 e697bfec9ab8ec899871e6a988eba1b3ec89bce8a681ebba9dec9881
UHC 旿울쉘q橈롳쉼要뺝영 11100111111110101011111111101111101111011010100101110001111010001111101010001110111011111011110110110000111010011010100110010101111001011011111110110101 e7fabfefbda971e8fa8eefbdb0e9a995e5bfb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)