To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 畑??乳??怨??押る?裕??蹂??悠?B 1001010010101000001111110011111110010011111110110011111100111111100010011000010100111111001111111000100110011111100000101110100100111111100101110101010000111111001111111110011011111000001111110011111110010111010010010011111101000010 94a83f3f93fb3f3f89853f3f899f82e93f97543f3fe6f83f3f97493f42
EUC-JP 畑??乳??怨??押る?裕??蹂??悠?B 1100100010101010001111110011111111000110111111010011111100111111101100011110010100111111001111111011001010100001101001001110101100111111110011011011010100111111001111111110110011111010001111110011111111001101101010100011111101000010 c8aa3f3fc6fd3f3fb1e53f3fb2a1a4eb3fcdb53f3fecfa3f3fcdaa3f42
UTF-8 畑밴퉭乳득끽怨⑹꽑押る굟裕녽뇹蹂껊씮悠틆B 11100111100101011001000111101011101100001011010011101101100010011010110111100100101110011011001111101011100100111001110111101011100000011011110111100110100000001010100011100010100100011011100111101010101111011001000111100110100010101011110011100011100000101000101111101010101101011001111111101000101000111001010111101011100001011011110111101011100001111011100111101000101110011000001011101010101110111000101011101100100101001010111011100110100000101010000011101101100010111000011001000010 e79591ebb0b4ed89ade4b9b3eb939deb81bde680a8e291b9eabd91e68abce3828beab59fe8a395eb85bdeb87b9e8b982eabb8aec94aee682a0ed8b8642
UHC 畑밴퉭乳득끽怨⑹꽑押る굟裕녽뇹蹂껊씮悠틆B 1110111110100101101110011110101010111001100001011110101011100001101101011110011010110011101000111110101010110011101010011110110010000100101000001110010011100011101010101110101110000010100001111110101110101110100001101110100110110100101001101110101110110011100000111110101110011101101111111110101011101101101110100111001001000010 efa5b9eab985eae1b5e6b3a3eab3a9ec84a0e4e3aaeb8287ebae86e9b4a6ebb383eb9dbfeaedba7242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)