To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 絶???③?樟?????樟??凹???①?^ 10010000111000100011111100111111001111111000011101000010001111111000111110111110001111110011111100111111001111110011111110001111101111100011111100111111100010011001101000111111001111110011111110000111010000000011111101011110 90e23f3f3f87423f8fbe3f3f3f3f3f8fbe3f3f899a3f3f3f87403f5e
EUC-JP 絶?????樟?????樟??凹?????^ 1100000011100100001111110011111100111111001111110011111110111110110000000011111100111111001111110011111100111111101111101100000000111111001111111011000111111010001111110011111100111111001111110011111101011110 c0e43f3f3f3f3fbec03f3f3f3f3fbec03f3fb1fa3f3f3f3f3f5e
UTF-8 絶욐땳禮③똼樟됭겏鍊깍풛樟됭겏凹귧땳禮①뒻^ 11100111101101011011011011101100100110101001000011101011100101011011001111101111101001101011011011100010100100011010001011101011100110001011110011100110101010001001111111101011100100001010110111101010101100101000111111101111101001101001101111101010101110011000110111101101100100101001101111100110101010001001111111101011100100001010110111101010101100101000111111100101100001111011100111101010101101111010011111101011100101011011001111101111101001101011011011100010100100011010000011101011100100101011101101011110 e7b5b6ec9a90eb95b3efa6b6e291a2eb98bce6a89feb90adeab28fefa69beab98ded929be6a89feb90adeab28fe587b9eab7a7eb95b3efa6b6e291a0eb92bb5e
UHC 絶욐땳禮③똼樟됭겏鍊깍풛樟됭겏凹귧땳禮①뒻^ 11101111101111101001111011101110100010111000100111100111110111111010100011101001100011001000001011101101111010011000100111101000100000011010100011100110111010001011000111101111101111101001111011101101111010011000100111101000100000011010100011101000111010101000001011101110100010111000100111100111110111111010100011100111100010101011000101011110 efbe9eee8b89e7dfa8e98c82ede989e881a8e6e8b1efbe9eede989e881a8e8ea82ee8b89e7dfa8e78ab15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)