To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???一??醫?????悠??蟻?????受? 00111111001111110011111110001000111010100011111100111111111001111100111000111111001111110011111100111111001111111001011101001001001111110011111110001011011000010011111100111111001111110011111100111111100011101111001100111111 3f3f3f88ea3f3fe7ce3f3f3f3f3f97493f3f8b613f3f3f3f3f8ef33f
EUC-JP ???一??醫?????悠??蟻?????受? 00111111001111110011111110110000111011000011111100111111111011101101000000111111001111110011111100111111001111111100110110101010001111110011111110110101110000100011111100111111001111110011111100111111101111001111010100111111 3f3f3fb0ec3f3feed03f3f3f3f3fcdaa3f3fb5c23f3f3f3f3fbcf53f
UTF-8 麗몃쓹一배굜醫귣룆列띕강悠낉쭚蟻쏇맰麗몃쓹受텯 111011111010011010001000111010111010101010000011111011001001001110111001111001001011100010000000111010111011000010110000111010101011010110011100111010011000011010101011111010101011011110100011111010111010001110000110111011111010011010011100111010111001110110010101111010101011000010010101111001101000001010100000111010111000001010001001111011001010110110011010111010001001111110111011111011001000111110000111111010111010011110110000111011111010011010001000111010111010101010000011111011001001001110111001111001011000111110010111111011011000010110101111 efa688ebaa83ec93b9e4b880ebb0b0eab59ce986abeab7a3eba386efa69ceb9d95eab095e682a0eb8289ecad9ae89fbbec8f87eba7b0efa688ebaa83ec93b9e58f97ed85af
UHC 麗몃쓹一배굜醫귣룆列띕강悠낉쭚蟻쏇맰麗몃쓹受텯 11100110101100001011100011101011100111011001010111101100111010011011100111101000100000101000010011101100101000101000001011101011100011111000010111100110111010101011011011101011101100001010110111101010111011011000010111101111101001111001000011101011111111001001101111101101100100001011011111100110101100001011100011101011100111011001010111100001111101001011011101000010 e6b0b8eb9d95ece9b9e88284eca282eb8f85e6eab6ebb0adeaed85efa790ebfc9bed90b7e6b0b8eb9d95e1f4b742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)