To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?寒漿?枇?翹?居^ 001111111000101010100110100111111111011100111111100101001111100000111111111000111100100100111111100010111000111101011110 3f8aa69ff73f94f83fe3c93f8b8f5e
EUC-JP ?寒漿?枇?翹?居^ 001111111011010010101000110111101111100100111111110010001111101000111111111001101100101100111111101101011110111101011110 3fb4a8def93fc8fa3fe6cb3fb5ef5e
UTF-8 뤋寒漿㎂枇샘翹렒居^ 11101011101001001000101111100101101011111001001011100110101111001011111111100011100011101000001011100110100111101000011111101100100000111001100011100111101111111011100111101011101000001001001011100101101100011000010101011110 eba48be5af92e6bcbfe38e82e69e87ec8398e7bfb9eba092e5b1855e
UHC 뤋寒漿㎂枇샘翹렒居^ 10001111101110111111100111001110111011011110110010100111110010111101110111101101101110111111100111001110111011101000111010100111110010111101110001011110 8fbbf9ceedeca7cbddedbbf9ceee8ea7cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)