To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?什????什??や^ 0011111110001111010110010011111100111111001111110011111110001111010110010011111100111111100000101110001001011110 3f8f593f3f3f3f8f593f3f82e25e
EUC-JP ?什??ʼn?什??や^ 00111111101111011011101000111111001111111000111110101001110010100011111110111101101110100011111100111111101001001110010001011110 3fbdba3f3f8fa9ca3fbdba3f3fa4e45e
UTF-8 룶什룫혧ʼn룶什룫혧や^ 111010111010001110110110111001001011101110000000111010111010001110101011111011011001100010100111110001011000100111101011101000111011011011100100101110111000000011101011101000111010101111101101100110001010011111100011100000101000010001011110 eba3b6e4bb80eba3abed98a7c589eba3b6e4bb80eba3abed98a7e382845e
UHC 룶什룫혧ʼn룶什룫혧や^ 100011111010101111100100101001111000111110100010110000101000111110101001101100001000111110101011111001001010011110001111101000101100001010001111101010101110010001011110 8fabe4a78fa2c28fa9b08fabe4a78fa2c28faae45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)