To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 媛私?怨?箏?正?媛私?怨?箏?正?^ 1001010101010001100011101000010000111111100010011000010100111111111000101011010100111111100100001011001100111111100101010101000110001110100001000011111110001001100001010011111111100010101101010011111110010000101100110011111101011110 95518e843f89853fe2b53f90b33f95518e843f89853fe2b53f90b33f5e
EUC-JP 媛私?怨?箏?正?媛私?怨?箏?正?^ 1100100110110010101110111110010000111111101100011110010100111111111001001011011100111111110000001011010100111111110010011011001010111011111001000011111110110001111001010011111111100100101101110011111111000000101101010011111101011110 c9b2bbe43fb1e53fe4b73fc0b53fc9b2bbe43fb1e53fe4b73fc0b53f5e
UTF-8 媛私떼怨렊箏렫正렢媛私떼怨렊箏렫正렋^ 11100101101010101001101111100111101001111000000111101011100101101011110011100110100000001010100011101011101000001000101011100111101011101000111111101011101000001010101111100110101011011010001111101011101000001010001011100101101010101001101111100111101001111000000111101011100101101011110011100110100000001010100011101011101000001000101011100111101011101000111111101011101000001010101111100110101011011010001111101011101000001000101101011110 e5aa9be7a781eb96bce680a8eba08ae7ae8feba0abe6ada3eba0a2e5aa9be7a781eb96bce680a8eba08ae7ae8feba0abe6ada3eba08b5e
UHC 媛私떼怨렊箏렫正렢媛私떼怨렊箏렫正렋^ 11101010101100001101111011100111101101101011110011101010101100111000111010100001111011101011010010001110101110011110111111100001100011101011001111101010101100001101111011100111101101101011110011101010101100111000111010100001111011101011010010001110101110011110111111100001100011101010001001011110 eab0dee7b6bceab38ea1eeb48eb9efe18eb3eab0dee7b6bceab38ea1eeb48eb9efe18ea25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)