To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??揖???〓?俑??揖???〓?癲 1000100101101001001111110011111110010111010010110011111100111111001111111000000110101100001111111001100011011010001111110011111110010111010010110011111100111111001111111000000110101100001111111110000110011111 89693f3f974b3f3f3f81ac3f98da3f3f974b3f3f3f81ac3fe19f
EUC-JP 永??揖??洹〓?俑??揖??洹〓?癲 101100011100101000111111001111111100110110101100001111110011111110001111110001111011101010100010101011100011111111010000110111000011111100111111110011011010110000111111001111111000111111000111101110101010001010101110001111111110001010100001 b1ca3f3fcdac3f3f8fc7baa2ae3fd0dc3f3fcdac3f3f8fc7baa2ae3fe2a1
UTF-8 永띔퍜揖섊독洹〓궖俑앹떣揖섊독洹〓궖癲 111001101011000010111000111010111001110110010100111011011000110110011100111001101000111110010110111011001000010010001010111010111000111110000101111001101011010010111001111000111000000010010011111010101011011010010110111001001011111110010001111011001001010110111001111010111001011010100011111001101000111110010110111011001000010010001010111010111000111110000101111001101011010010111001111000111000000010010011111010101011011010010110111001111001100110110010 e6b0b8eb9d94ed8d9ce68f96ec848aeb8f85e6b4b9e38093eab696e4bf91ec95b9eb96a3e68f96ec848aeb8f85e6b4b9e38093eab696e799b2
UHC 永띔퍜揖섊독洹〓궖俑앹떣揖섊독洹〓궖癲 1110011110110101101101101110101010111011100100111110101111100111100110001110011110110101101101101110101010110111101000011110101110000010101010111110100110110101100111011110110010001011101101111110101111100111100110001110011110110101101101101110101010110111101000011110101110000010101010111110111110100110 e7b5b6eabb93ebe798e7b5b6eab7a1eb82abe9b59dec8bb7ebe798e7b5b6eab7a1eb82abefa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)