To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??序げ?仇??莎???序げ?仇??莎や^ 001111110011111110001111100110001000001010110000001111111000101101110111001111110011111111100100101100110011111100111111001111111000111110011000100000101011000000111111100010110111011100111111001111111110010010110011100000101110001001011110 3f3f8f9882b03f8b773f3fe4b33f3f3f8f9882b03f8b773f3fe4b382e25e
EUC-JP ??序げ?仇??莎ʼn??序げ?仇??莎や^ 0011111100111111101111011111100010100100101100100011111110110101110110000011111100111111111010001011010110001111101010011100101000111111001111111011110111111000101001001011001000111111101101011101100000111111001111111110100010110101101001001110010001011110 3f3fbdf8a4b23fb5d83f3fe8b58fa9ca3f3fbdf8a4b23fb5d83f3fe8b5a4e45e
UTF-8 룶깹序げ룶仇룫찼莎ʼn룶깹序げ룶仇룫찼莎や^ 111010111010001110110110111010101011100110111001111001011011101010001111111000111000000110010010111010111010001110110110111001001011101110000111111010111010001110101011111011001011000010111100111010001000111010001110110001011000100111101011101000111011011011101010101110011011100111100101101110101000111111100011100000011001001011101011101000111011011011100100101110111000011111101011101000111010101111101100101100001011110011101000100011101000111011100011100000101000010001011110 eba3b6eab9b9e5ba8fe38192eba3b6e4bb87eba3abecb0bce88e8ec589eba3b6eab9b9e5ba8fe38192eba3b6e4bb87eba3abecb0bce88e8ee382845e
UHC 룶깹序げ룶仇룫찼莎ʼn룶깹序げ룶仇룫찼莎や^ 1000111110101011101100101010000111011111111011011010101010110010100011111010101111001110111110111000111110100010110000111010000111011110111011011010100110110000100011111010101110110010101000011101111111101101101010101011001010001111101010111100111011111011100011111010001011000011101000011101111011101101101010101110010001011110 8fabb2a1dfedaab28fabcefb8fa2c3a1deeda9b08fabb2a1dfedaab28fabcefb8fa2c3a1deedaae45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)