To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???毅??諛??????汚??悠?Ⅶ儒??B 0011111100111111001111111000101101000010001111110011111111100110100001110011111100111111001111110011111100111111001111111000100110011000001111110011111110010111010010010011111110000111010110101000111011110010001111110011111101000010 3f3f3f8b423f3fe6873f3f3f3f3f3f89983f3f97493f875a8ef23f3f42
EUC-JP ???毅??諛??????汚??悠??儒??B 00111111001111110011111110110101101000110011111100111111111010111110011100111111001111110011111100111111001111110011111110110001111110000011111100111111110011011010101000111111001111111011110011110100001111110011111101000010 3f3f3fb5a33f3febe73f3f3f3f3f3fb1f83f3fcdaa3f3fbcf43f3f42
UTF-8 樂낅뗄毅뗦룚諛㏐퍢廉띘댄떔汚삳낌悠띰Ⅶ儒밸옩B 11101111101001101011111111101011100000101000010111101011100101111000010011100110101011111000010111101011100101111010011011101011101000111001101011101000101010111001101111100011100011111001000011101101100011011010001011101111101001101010001011101011100111011001100011101011100011001000010011101011100101101001010011100110101100011001101011101100100000101011001111101011100000101000110011100110100000101010000011101011100111011011000011100010100001011010011011100101100001001001001011101011101100001011100011101100100110001010100101000010 efa6bfeb8285eb9784e6af85eb97a6eba39ae8ab9be38f90ed8da2efa6a2eb9d98eb8c84eb9694e6b19aec82b3eb828ce682a0eb9db0e285a6e58492ebb0b8ec98a942
UHC 樂낅뗄毅뗦룚諛㏐퍢廉띘댄떔汚삳낌悠띰Ⅶ儒밸옩B 111010001111100110000101111010111011011010111111111010111111011010001011111001101000111110010110111010111011000010100111111010101011101110011001111001101111010110001101110011101011010011101101100010111010101011100111111111011011101111101011101100111010011011101010111011011011011011101111101001011011011011101010111000111011100111101011100111101010100001000010 e8f985ebb6bfebf68be68f96ebb0a7eabb99e6f58dceb4ed8baae7fdbbebb3a6eaedb6efa5b6eae3b9eb9ea842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)