To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????O 00111111001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f3f3f3f4f
SJIS-WIN 語??誼?Ⅴ儒??O 1000110011101010001111110011111110001011011000100011111110000111010110001000111011110010001111110011111101001111 8cea3f3f8b623f87588ef23f3f4f
EUC-JP 語??誼??儒??O 10111000111011000011111100111111101101011100001100111111001111111011110011110100001111110011111101001111 b8ec3f3fb5c33f3fbcf43f3f4f
UTF-8 語뤴뫖誼믭Ⅴ儒몄졋O 11101000101010101001111011101011101001001011010011101011101010111001011011101000101010101011110011101011101011111010110111100010100001011010010011100101100001001001001011101011101010101000010011101100101000011000101101001111 e8aa9eeba4b4ebab96e8aabcebafade285a4e58492ebaa84eca18b4f
UHC 語뤴뫖誼믭Ⅴ儒몄졋O 11100101110111101000111111100010100100011011100011101011111111101001001011101111101001011011010011101010111000111011100011101100101000001011101001001111 e5de8fe291b8ebfe92efa5b4eae3b8eca0ba4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)