To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????Ð???n}?????Ð???n{^ 0011111100111111001111110011111100111111110100000011111100111111001111110110111001111101001111110011111100111111001111110011111111010000001111110011111100111111011011100111101101011110 3f3f3f3f3fd03f3f3f6e7d3f3f3f3f3fd03f3f3f6e7b5e
SJIS-WIN 矮??訝??岳??n}矮??訝??岳??n{^ 1110000111100010001111110011111111100110011000100011111100111111100010100111100000111111001111110110111001111101111000011110001000111111001111111110011001100010001111110011111110001010011110000011111100111111011011100111101101011110 e1e23f3fe6623f3f8a783f3f6e7de1e23f3fe6623f3f8a783f3f6e7b5e
EUC-JP 矮??訝??岳??n}矮??訝??岳??n{^ 1110001011100100001111110011111111101011110000110011111100111111101100111101100100111111001111110110111001111101111000101110010000111111001111111110101111000011001111110011111110110011110110010011111100111111011011100111101101011110 e2e43f3febc33f3fb3d93f3f6e7de2e43f3febc33f3fb3d93f3f6e7b5e
UTF-8 矮욅뼯訝띰Ð岳뜹돺n}矮욅뼯訝띰Ð岳뜹돺n{^ 111001111001111110101110111011001001101010000101111010111011110010101111111010001010100010011101111010111001110110110000110000111001000011100101101100101011001111101011100111001011100111101011100011111011101001101110011111011110011110011111101011101110110010011010100001011110101110111100101011111110100010101000100111011110101110011101101100001100001110010000111001011011001010110011111010111001110010111001111010111000111110111010011011100111101101011110 e79faeec9a85ebbcafe8a89deb9db0c390e5b2b3eb9cb9eb8fba6e7de79faeec9a85ebbcafe8a89deb9db0c390e5b2b3eb9cb9eb8fba6e7b5e
UHC 矮욅뼯訝띰Ð岳뜹돺n}矮욅뼯訝띰Ð岳뜹돺n{^ 1110100011100001100111101110011110010110101100101110010010111000101101101110111110101000101000101110010010111111101101101110010110001001101111010110111001111101111010001110000110011110111001111001011010110010111001001011100010110110111011111010100010100010111001001011111110110110111001011000100110111101011011100111101101011110 e8e19ee796b2e4b8b6efa8a2e4bfb6e589bd6e7de8e19ee796b2e4b8b6efa8a2e4bfb6e589bd6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)