To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 曄?????縊h、[曄?????縊h、[^ 1001111001000000001111110011111100111111001111110011111111100011011011111000001010001000100000010100000101011011100111100100000000111111001111110011111100111111001111111110001101101111100000101000100010000001010000010101101101011110 9e403f3f3f3f3fe36f828881415b9e403f3f3f3f3fe36f828881415b5e
EUC-JP 曄?????縊h、[曄?????縊h、[^ 1101101110100001001111110011111100111111001111110011111111100101110100001010001111101000101000011010001001011011110110111010000100111111001111110011111100111111001111111110010111010000101000111110100010100001101000100101101101011110 dba13f3f3f3f3fe5d0a3e8a1a25bdba13f3f3f3f3fe5d0a3e8a1a25b5e
UTF-8 曄됯퀋溜뤿젙縊h、[曄됯퀋溜뤿젙縊h、[^ 111001101001101110000100111010111001000010101111111011011000000010001011111011111010011110001011111010111010010010111111111011001010000010011001111001111011100010001010111011111011110110001000111000111000000010000001010110111110011010011011100001001110101110010000101011111110110110000000100010111110111110100111100010111110101110100100101111111110110010100000100110011110011110111000100010101110111110111101100010001110001110000000100000010101101101011110 e69b84eb90afed808befa78beba4bfeca099e7b88aefbd88e380815be69b84eb90afed808befa78beba4bfeca099e7b88aefbd88e380815b5e
UHC 曄됯퀋溜뤿젙縊h、[曄됯퀋溜뤿젙縊h、[^ 111001111010010110001001111010101011001110000001111010101111111010001111111010111010000010010101111001001111110010100011111010001010000110100010010110111110011110100101100010011110101010110011100000011110101011111110100011111110101110100000100101011110010011111100101000111110100010100001101000100101101101011110 e7a589eab381eafe8feba095e4fca3e8a1a25be7a589eab381eafe8feba095e4fca3e8a1a25b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)