To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????r[?????????r[^ 0011111100111111001111110011111100111111001111110011111100111111001111110111001001011011001111110011111100111111001111110011111100111111001111110011111100111111011100100101101101011110 3f3f3f3f3f3f3f3f3f725b3f3f3f3f3f3f3f3f3f725b5e
SJIS-WIN ??ゲ??こ??あr[??ゲ??こ??あr[^ 0011111100111111100000110101000100111111001111111000001010110001001111110011111110000010101000000111001001011011001111110011111110000011010100010011111100111111100000101011000100111111001111111000001010100000011100100101101101011110 3f3f83513f3f82b13f3f82a0725b3f3f83513f3f82b13f3f82a0725b5e
EUC-JP ??ゲ??こ??あr[??ゲ??こ??あr[^ 0011111100111111101001011011001000111111001111111010010010110011001111110011111110100100101000100111001001011011001111110011111110100101101100100011111100111111101001001011001100111111001111111010010010100010011100100101101101011110 3f3fa5b23f3fa4b33f3fa4a2725b3f3fa5b23f3fa4b33f3fa4a2725b5e
UTF-8 룶찋ゲ룵퓦こ룵쥚あr[룶찋ゲ룵퓦こ룵쥚あr[^ 1110101110100011101101101110110010110000100010111110001110000010101100101110101110100011101101011110110110010011101001101110001110000001100100111110101110100011101101011110110010100101100110101110001110000001100000100111001001011011111010111010001110110110111011001011000010001011111000111000001010110010111010111010001110110101111011011001001110100110111000111000000110010011111010111010001110110101111011001010010110011010111000111000000110000010011100100101101101011110 eba3b6ecb08be382b2eba3b5ed93a6e38193eba3b5eca59ae38182725beba3b6ecb08be382b2eba3b5ed93a6e38193eba3b5eca59ae38182725b5e
UHC 룶찋ゲ룵퓦こ룵쥚あr[룶찋ゲ룵퓦こ룵쥚あr[^ 1000111110101011101010011000111110101011101100101000111110101010101111111000111110101010101100111000111110101010101000101000111110101010101000100111001001011011100011111010101110101001100011111010101110110010100011111010101010111111100011111010101010110011100011111010101010100010100011111010101010100010011100100101101101011110 8faba98fabb28faabf8faab38faaa28faaa2725b8faba98fabb28faabf8faab38faaa28faaa2725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)