To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????n}??????????n{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 倭??哀?倭??哀?n}倭??哀?倭??哀?n{^ 100110000110000000111111001111111000100010100011001111111001100001100000001111110011111110001000101000110011111101101110011111011001100001100000001111110011111110001000101000110011111110011000011000000011111100111111100010001010001100111111011011100111101101011110 98603f3f88a33f98603f3f88a33f6e7d98603f3f88a33f98603f3f88a33f6e7b5e
EUC-JP 倭??哀?倭??哀?n}倭??哀?倭??哀?n{^ 110011111100000100111111001111111011000010100101001111111100111111000001001111110011111110110000101001010011111101101110011111011100111111000001001111110011111110110000101001010011111111001111110000010011111100111111101100001010010100111111011011100111101101011110 cfc13f3fb0a53fcfc13f3fb0a53f6e7dcfc13f3fb0a53fcfc13f3fb0a53f6e7b5e
UTF-8 倭끸닽哀퉉倭끸닽哀툷n}倭끸닽哀퉉倭끸닽哀툷n{^ 1110010110000000101011011110101110000001101110001110101110001011101111011110010110010011100000001110110110001001100010011110010110000000101011011110101110000001101110001110101110001011101111011110010110010011100000001110110110001000101101110110111001111101111001011000000010101101111010111000000110111000111010111000101110111101111001011001001110000000111011011000100110001001111001011000000010101101111010111000000110111000111010111000101110111101111001011001001110000000111011011000100010110111011011100111101101011110 e580adeb81b8eb8bbde59380ed8989e580adeb81b8eb8bbde59380ed88b76e7de580adeb81b8eb8bbde59380ed8989e580adeb81b8eb8bbde59380ed88b76e7b5e
UHC 倭끸닽哀퉉倭끸닽哀툷n}倭끸닽哀퉉倭끸닽哀툷n{^ 111010001101111010000101111000101000100010101011111001001110111010111001010101111110100011011110100001011110001010001000101010111110010011101110101110010100101001101110011111011110100011011110100001011110001010001000101010111110010011101110101110010101011111101000110111101000010111100010100010001010101111100100111011101011100101001010011011100111101101011110 e8de85e288abe4eeb957e8de85e288abe4eeb94a6e7de8de85e288abe4eeb957e8de85e288abe4eeb94a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)