To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Oh?????????O 001111110011111100111111001111110011111100111111001111110011111100111111010011110110100000111111001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f3f3f3f4f683f3f3f3f3f3f3f3f3f4f
SJIS-WIN ???厭?????Oh???厭?????O 0011111100111111001111111000100101111101001111110011111100111111001111110011111101001111011010000011111100111111001111111000100101111101001111110011111100111111001111110011111101001111 3f3f3f897d3f3f3f3f3f4f683f3f3f897d3f3f3f3f3f4f
EUC-JP 轝??厭?????Oh轝??厭?????O 100011111110000110101010001111110011111110110001110111100011111100111111001111110011111100111111010011110110100010001111111000011010101000111111001111111011000111011110001111110011111100111111001111110011111101001111 8fe1aa3f3fb1de3f3f3f3f3f4f688fe1aa3f3fb1de3f3f3f3f3f4f
UTF-8 轝뚮젶厭묒뼐溜㏓젪Oh轝뚮젶厭묒뼐溜㏓젪O 111010001011110110011101111010111001101010101110111011001010000010110110111001011000111010101101111010111010110010010010111010111011110010010000111011111010011110001011111000111000111110010011111011001010000010101010010011110110100011101000101111011001110111101011100110101010111011101100101000001011011011100101100011101010110111101011101011001001001011101011101111001001000011101111101001111000101111100011100011111001001111101100101000001010101001001111 e8bd9deb9aaeeca0b6e58eadebac92ebbc90efa78be38f93eca0aa4f68e8bd9deb9aaeeca0b6e58eadebac92ebbc90efa78be38f93eca0aa4f
UHC 轝뚮젶厭묒뼐溜㏓젪Oh轝뚮젶厭묒뼐溜㏓젪O 111001101010110010001100111010111010000010101010111001101111010010010001111011001001011010011000111010101111111010100111111010111010000010100010010011110110100011100110101011001000110011101011101000001010101011100110111101001001000111101100100101101001100011101010111111101010011111101011101000001010001001001111 e6ac8ceba0aae6f491ec9698eafea7eba0a24f68e6ac8ceba0aae6f491ec9698eafea7eba0a24f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)