To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???長????長?^ 00111111001111110011111110010010101101110011111100111111001111110011111110010010101101110011111101011110 3f3f3f92b73f3f3f3f92b73f5e
EUC-JP 獐??長?獐??長?^ 1000111111001011101110100011111100111111110001001011100100111111100011111100101110111010001111110011111111000100101110010011111101011110 8fcbba3f3fc4b93f8fcbba3f3fc4b93f5e
UTF-8 獐곈뱄長쁩獐곈뱄長쁠^ 11100111100011011001000011101010101100111000100011101011101100011000010011101001100101011011011111101100100000011010100111100111100011011001000011101010101100111000100011101011101100011000010011101001100101011011011111101100100000011010000001011110 e78d90eab388ebb184e995b7ec81a9e78d90eab388ebb184e995b7ec81a05e
UHC 獐곈뱄長쁩獐곈뱄長쁠^ 111011011110111110110000111010011011100111101111111011011111111010111011110111101110110111101111101100001110100110111001111011111110110111111110101110111101110001011110 edefb0e9b9efedfebbdeedefb0e9b9efedfebbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)