To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?????薪?薪?旭???????薪?薪?旭??^ 00111111001111110011111100111111001111111001000001100100001111111001000001100100001111111000100010101110001111110011111100111111001111110011111100111111001111111001000001100100001111111001000001100100001111111000100010101110001111110011111101011110 3f3f3f3f3f90643f90643f88ae3f3f3f3f3f3f3f90643f90643f88ae3f3f5e
EUC-JP ?????薪?薪?旭???????薪?薪?旭??^ 00111111001111110011111100111111001111111011111111000101001111111011111111000101001111111011000010110000001111110011111100111111001111110011111100111111001111111011111111000101001111111011111111000101001111111011000010110000001111110011111101011110 3f3f3f3f3fbfc53fbfc53fb0b03f3f3f3f3f3f3fbfc53fbfc53fb0b03f3f5e
UTF-8 쒀렣쑹렟쒔薪렖薪렖旭렖렜쒀렣쑹렟쒔薪렖薪렖旭렖렜^ 11101100100100101000000011101011101000001010001111101100100100011011100111101011101000001001111111101100100100101001010011101000100101101010101011101011101000001001011011101000100101101010101011101011101000001001011011100110100101111010110111101011101000001001011011101011101000001001110011101100100100101000000011101011101000001010001111101100100100011011100111101011101000001001111111101100100100101001010011101000100101101010101011101011101000001001011011101000100101101010101011101011101000001001011011100110100101111010110111101011101000001001011011101011101000001001110001011110 ec9280eba0a3ec91b9eba09fec9294e896aaeba096e896aaeba096e697adeba096eba09cec9280eba0a3ec91b9eba09fec9294e896aaeba096e896aaeba096e697adeba096eba09c5e
UHC 쒀렣쑹렟쒔薪렖薪렖旭렖렜쒀렣쑹렟쒔薪렖薪렖旭렖렜^ 10111110101011001000111010110100101111101010101110001110101100001011111010101101111000111110111110001110101010111110001111101111100011101010101111101001111011111000111010101011100011101010111010111110101011001000111010110100101111101010101110001110101100001011111010101101111000111110111110001110101010111110001111101111100011101010101111101001111011111000111010101011100011101010111001011110 beac8eb4beab8eb0beade3ef8eabe3ef8eabe9ef8eab8eaebeac8eb4beab8eb0beade3ef8eabe3ef8eabe9ef8eab8eae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)