To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???儀??濡??畑??濡?????儀?Ⅹ^ 00111111001111110011111110001011010101100011111100111111100101000100011100111111001111111001010010101000001111110011111110010100010001110011111100111111001111110011111100111111100010110101011000111111100001110101110101011110 3f3f3f8b563f3f94473f3f94a83f3f94473f3f3f3f3f8b563f875d5e
EUC-JP ???儀??濡??畑??濡?????儀??^ 001111110011111100111111101101011011011100111111001111111100011110101000001111110011111111001000101010100011111100111111110001111010100000111111001111110011111100111111001111111011010110110111001111110011111101011110 3f3f3fb5b73f3fc7a83f3fc8aa3f3fc7a83f3f3f3f3fb5b73f3f5e
UTF-8 溜딅줇儀붹콨濡쏆뮅畑밸젇濡쏆뮅黎앸줇儀뷂Ⅹ^ 11101111101001111000101111101011100101001000010111101100101001001000011111100101100001001000000011101011101101101011100111101100101111011010100011100110101111111010000111101100100011111000011011101011101011101000010111100111100101011001000111101011101100001011100011101100101000001000011111100110101111111010000111101100100011111000011011101011101011101000010111101111101001101000100111101100100101011011100011101100101001001000011111100101100001001000000011101011101101111000001011100010100001011010100101011110 efa78beb9485eca487e58480ebb6b9ecbda8e6bfa1ec8f86ebae85e79591ebb0b8eca087e6bfa1ec8f86ebae85efa689ec95b8eca487e58480ebb782e285a95e
UHC 溜딅줇儀붹콨濡쏆뮅畑밸젇濡쏆뮅黎앸줇儀뷂Ⅹ^ 11101010111111101000101011101011101000011001101111101011111100001001010011100110101100011001110111101011101000011001101111101100100100101001010011101111101001011011100111101011101000001000101011101011101000011001101111101100100100101001010011100110101100011001110111101011101000011001101111101011111100001001010011101111101001011011100101011110 eafe8aeba19bebf094e6b19deba19bec9294efa5b9eba08aeba19bec9294e6b19deba19bebf094efa5b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)