To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 恁??姨??????ъ????恁??夷???ъ?B 10011100100011000011111100111111100110110100100000111111001111110011111100111111001111110011111110000100100011000011111100111111001111110011111110011100100011000011111100111111100010001100111000111111001111110011111110000100100011000011111101000010 9c8c3f3f9b483f3f3f3f3f3f848c3f3f3f3f9c8c3f3f88ce3f3f3f848c3f42
EUC-JP 恁??姨??????ъ????恁??夷???ъ?B 11010111111011000011111100111111110101011010100100111111001111110011111100111111001111110011111110100111111011000011111100111111001111110011111111010111111011000011111100111111101100001101000000111111001111110011111110100111111011000011111101000010 d7ec3f3fd5a93f3f3f3f3f3fa7ec3f3f3f3fd7ec3f3fb0d03f3f3fa7ec3f42
UTF-8 恁㏃㎣姨붿쭡淋륁콬吏ъ쭦吏뺤㎟恁㏃㎚夷덊삨吏ъ쭡B 1110011010000001100000011110001110001111100000111110001110001110101000111110010110100111101010001110101110110110101111111110110010101101101000011110111110100111101101011110101110100101100000011110110010111101101011001110111110100111100111101101000110001010111011001010110110100110111011111010011110011110111010111011101010100100111000111000111010011111111001101000000110000001111000111000111110000011111000111000111010011010111001011010010010110111111010111000110110001010111011001000001010101000111011111010011110011110110100011000101011101100101011011010000101000010 e68181e38f83e38ea3e5a7a8ebb6bfecada1efa7b5eba581ecbdacefa79ed18aecada6efa79eebbaa4e38e9fe68181e38f83e38e9ae5a4b7eb8d8aec82a8efa79ed18aecada142
UHC 恁㏃㎣姨붿쭡淋륁콬吏ъ쭦吏뺤㎟恁㏃㎚夷덊삨吏ъ쭡B 11101100111101101010011111101100101001111010011111101100101010011001010011101100101001111001011011101100111110001000111111101100101100011010000011101100101001111010110011101100101001111001101011101100101001111001010111101100101001111011000111101100111101101010011111101100101001111010110011101100101010001000100011101101100110001010011111101100101001111010110011101100101001111001011001000010 ecf6a7eca7a7eca994eca796ecf88fecb1a0eca7aceca79aeca795eca7b1ecf6a7eca7aceca888ed98a7eca7aceca79642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)