To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嗚?????癒⑦?畑??嗚?????癒⑦?畑??B 100110100110101000111111001111110011111100111111001111111001011011111100100001110100011000111111100101001010100000111111001111111001101001101010001111110011111100111111001111110011111110010110111111001000011101000110001111111001010010101000001111110011111101000010 9a6a3f3f3f3f3f96fc87463f94a83f3f9a6a3f3f3f3f3f96fc87463f94a83f3f42
EUC-JP 嗚?????癒??畑??嗚?????癒??畑??B 11010011110010110011111100111111001111110011111100111111110011001111111000111111001111111100100010101010001111110011111111010011110010110011111100111111001111110011111100111111110011001111111000111111001111111100100010101010001111110011111101000010 d3cb3f3f3f3f3fccfe3f3fc8aa3f3fd3cb3f3f3f3f3fccfe3f3fc8aa3f3f42
UTF-8 嗚삳떧痢볢뿥癒⑦룙畑밸샊嗚삳떧痢볢뿥癒⑦룙畑밸샊B 11100101100101111001101011101100100000101011001111101011100101101010011111101111101001111010010111101011101100111010001011101011101111111010010111100111100110011001001011100010100100011010011011101011101000111001100111100111100101011001000111101011101100001011100011101100100000111000101011100101100101111001101011101100100000101011001111101011100101101010011111101111101001111010010111101011101100111010001011101011101111111010010111100111100110011001001011100010100100011010011011101011101000111001100111100111100101011001000111101011101100001011100011101100100000111000101001000010 e5979aec82b3eb96a7efa7a5ebb3a2ebbfa5e79992e291a6eba399e79591ebb0b8ec838ae5979aec82b3eb96a7efa7a5ebb3a2ebbfa5e79992e291a6eba399e79591ebb0b8ec838a42
UHC 嗚삳떧痢볢뿥癒⑦룙畑밸샊嗚삳떧痢볢뿥癒⑦룙畑밸샊B 11100111111100001011101111101011100010111011101011101100101110001001001111101000100101111010010111101011101010001010100011101101100011111001010111101111101001011011100111101011100110001011100111100111111100001011101111101011100010111011101011101100101110001001001111101000100101111010010111101011101010001010100011101101100011111001010111101111101001011011100111101011100110001011100101000010 e7f0bbeb8bbaecb893e897a5eba8a8ed8f95efa5b9eb98b9e7f0bbeb8bbaecb893e897a5eba8a8ed8f95efa5b9eb98b942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)