To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈??溢??節??瑤??徇?Ⅷ音⑤?魚 10011110111101000011111100111111100010001110110000111111001111111001000011011111001111110011111111101010101000100011111100111111100111000110110100111111100001110101101110001001101110011000011101000100001111111000101110011011 9ef43f3f88ec3f3f90df3f3feaa23f3f9c6d3f875b89b987443f8b9b
EUC-JP 橈??溢??節??瑤??徇??音??魚 1101110011110110001111110011111110110000111011100011111100111111110000001110000100111111001111111111010010100100001111110011111111010111110011100011111100111111101100101011101100111111001111111011010111111011 dcf63f3fb0ee3f3fc0e13f3ff4a43f3fd7ce3f3fb2bb3f3fb5fb
UTF-8 橈볥씛溢€꼷節낇맪瑤뗭슜徇랃Ⅷ音⑤궚魚 111001101010100110001000111010111011001110100101111011001001010010011011111001101011101010100010111000101000001010101100111010101011110010110111111001111010111110000000111010111000001010000111111010111010011110101010111001111001000110100100111010111001011110101101111011001000101010011100111001011011111010000111111010111001111010000011111000101000010110100111111010011001111110110011111000101001000110100100111010101011011010011010111010011010110110011010 e6a988ebb3a5ec949be6baa2e282aceabcb7e7af80eb8287eba7aae791a4eb97adec8a9ce5be87eb9e83e285a7e99fb3e291a4eab69ae9ad9a
UHC 橈볥씛溢€꼷節낇맪瑤뗭슜徇랃Ⅷ音⑤궚魚 1110100011111010100100111110101110011101101100001110110011101110101000101110011010000100100011111110111110111101100001011110110110010000101100101110100011111101100010111110110010011010101010011110001011011111100011011110111110100101101101111110101111100101101010001110101110000010101011111110010111100000 e8fa93eb9db0eceea2e6848fefbd85ed90b2e8fd8bec9aa9e2df8defa5b7ebe5a8eb82afe5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)