To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????n}??????????n{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 倭??源?倭??源?n}倭??源?倭??源?n{^ 100110000110000000111111001111111000110010111001001111111001100001100000001111110011111110001100101110010011111101101110011111011001100001100000001111110011111110001100101110010011111110011000011000000011111100111111100011001011100100111111011011100111101101011110 98603f3f8cb93f98603f3f8cb93f6e7d98603f3f8cb93f98603f3f8cb93f6e7b5e
EUC-JP 倭??源?倭??源?n}倭??源?倭??源?n{^ 110011111100000100111111001111111011100010111011001111111100111111000001001111110011111110111000101110110011111101101110011111011100111111000001001111110011111110111000101110110011111111001111110000010011111100111111101110001011101100111111011011100111101101011110 cfc13f3fb8bb3fcfc13f3fb8bb3f6e7dcfc13f3fb8bb3fcfc13f3fb8bb3f6e7b5e
UTF-8 倭녾난源멵倭녾난源멞n}倭녾난源멵倭녾난源멞n{^ 1110010110000000101011011110101110000101101111101110101110000010100111001110011010111010100100001110101110101001101101011110010110000000101011011110101110000101101111101110101110000010100111001110011010111010100100001110101110101001100111100110111001111101111001011000000010101101111010111000010110111110111010111000001010011100111001101011101010010000111010111010100110110101111001011000000010101101111010111000010110111110111010111000001010011100111001101011101010010000111010111010100110011110011011100111101101011110 e580adeb85beeb829ce6ba90eba9b5e580adeb85beeb829ce6ba90eba99e6e7de580adeb85beeb829ce6ba90eba9b5e580adeb85beeb829ce6ba90eba99e6e7b5e
UHC 倭녾난源멵倭녾난源멞n}倭녾난源멵倭녾난源멞n{^ 111010001101111010000110111010101011001110101101111010101011100110010001011000111110100011011110100001101110101010110011101011011110101010111001100100010100111001101110011111011110100011011110100001101110101010110011101011011110101010111001100100010110001111101000110111101000011011101010101100111010110111101010101110011001000101001110011011100111101101011110 e8de86eab3adeab99163e8de86eab3adeab9914e6e7de8de86eab3adeab99163e8de86eab3adeab9914e6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)