To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????] 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5d
SJIS-WIN 鵝??肄??淫??繹??悠??乙???k?] 1110101001000000001111110011111111100011111001010011111100111111100010001111101000111111001111111110001110001000001111110011111110010111010010010011111100111111100010011011001100111111001111110011111110000010100010110011111101011101 ea403f3fe3e53f3f88fa3f3fe3883f3f97493f3f89b33f3f3f828b3f5d
EUC-JP 鵝??肄??淫??繹??悠??乙???k?] 1111001110100001001111110011111111100110111001110011111100111111101100001111110000111111001111111110010111101000001111110011111111001101101010100011111100111111101100101011010100111111001111110011111110100011111010110011111101011101 f3a13f3fe6e73f3fb0fc3f3fe5e83f3fcdaa3f3fb2b53f3f3fa3eb3f5d
UTF-8 鵝숈뮆肄덃끽淫딅쎗繹먮씮悠긺솾乙노굫力k띃] 11101001101101011001110111101100100010001000100011101011101011101000011011101000100000101000010011101011100011011000001111101011100000011011110111100110101101111010101111101011100101001000010111101100100011101001011111100111101110011011100111101011101010001010111011101100100101001010111011100110100000101010000011101010101110001011101011101100100001101011111011100100101110011001100111101011100001011011100011101010101101011010101111101111101001101000101011101111101111011000101111101011100111011000001101011101 e9b59dec8888ebae86e88284eb8d83eb81bde6b7abeb9485ec8e97e7b9b9eba8aeec94aee682a0eab8baec86bee4b999eb85b8eab5abefa68aefbd8beb9d835d
UHC 鵝숈뮆肄덃끽淫딅쎗繹먮씮悠긺솾乙노굫力k띃] 11100100101111011001100111101100100100101001010111101100101111011000100011100110101100111010001111101011111000101000101011101011100110111011111011100110101110101001000011101011100111011011111111101010111011011011000111100111100110011011001011101011111000001011001111101011100000101001000111100110101100111010001111101011100011011011111001011101 e4bd99ec9295ecbd88e6b3a3ebe28aeb9bbee6ba90eb9dbfeaedb1e799b2ebe0b3eb8291e6b3a3eb8dbe5d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)