To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????O 00111111001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f3f3f3f4f
SJIS-WIN 筌≪?泣??遺??O 1110001010100011100000011110000100111111100010111000001100111111001111111000100011100010001111110011111101001111 e2a381e13f8b833f3f88e23f3f4f
EUC-JP 筌≪?泣??遺??O 1110010010100101101000101110001100111111101101011110001100111111001111111011000011100100001111110011111101001111 e4a5a2e33fb5e33f3fb0e43f3f4f
UTF-8 筌≪눛泣놅㎖遺우퐠O 11100111101011011000110011100010100010011010101011101011100010001001101111100110101100111010001111101011100001101000010111100011100011101001011011101001100000011011101011101100100110101011000011101101100100001010000001001111 e7ad8ce289aaeb889be6b3a3eb8685e38e96e981baec9ab0ed90a04f
UHC 筌≪눛泣놅㎖遺우퐠O 11101111101001111010000111101100100001111011001111101011111010001000011011101111101001111010001011101011101101101011111111101100101111011000100101001111 efa7a1ec87b3ebe886efa7a2ebb6bfecbd894f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)