To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 怏??意?????濡??淫??濡??濡??^ 10011100100010010011111100111111100010001101001100111111001111110011111100111111001111111001010001000111001111110011111110001000111110100011111100111111100101000100011100111111001111111001010001000111001111110011111101011110 9c893f3f88d33f3f3f3f3f94473f3f88fa3f3f94473f3f94473f3f5e
EUC-JP 怏??意?????濡??淫??濡??濡??^ 11010111111010010011111100111111101100001101010100111111001111110011111100111111001111111100011110101000001111110011111110110000111111000011111100111111110001111010100000111111001111111100011110101000001111110011111101011110 d7e93f3fb0d53f3f3f3f3fc7a83f3fb0fc3f3fc7a83f3fc7a83f3f5e
UTF-8 怏⑹꼫意붾죿溜뤿죭濡쀥뒰淫덉뵒濡쀫젎濡덈죻^ 11100110100000001000111111100010100100011011100111101010101111001010101111100110100001001000111111101011101101101011111011101100101000111011111111101111101001111000101111101011101001001011111111101100101000111010110111100110101111111010000111101100100000001010010111101011100100101011000011100110101101111010101111101011100011011000100111101011101101011001001011100110101111111010000111101100100000001010101111101100101000001000111011100110101111111010000111101011100011011000100011101100101000111011101101011110 e6808fe291b9eabcabe6848febb6beeca3bfefa78beba4bfeca3ade6bfa1ec80a5eb92b0e6b7abeb8d89ebb592e6bfa1ec80abeca08ee6bfa1eb8d88eca3bb5e
UHC 怏⑹꼫意붾죿溜뤿죭濡쀥뒰淫덉뵒濡쀫젎濡덈죻^ 11100100111010001010100111101100100001001000100011101011111100101001010011101011101000011001011111101010111111101000111111101011101000011000100011101011101000011001011111100101100010101010100111101011111000101000100011101100100101001001010011101011101000011001011111101011101000001000111111101011101000011000100011101011101000011001010101011110 e4e8a9ec8488ebf294eba197eafe8feba188eba197e58aa9ebe288ec9494eba197eba08feba188eba1955e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)