To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????{N}????????{N{^ 0011111100111111001111110011111100111111001111110011111100111111011110110100111001111101001111110011111100111111001111110011111100111111001111110011111101111011010011100111101101011110 3f3f3f3f3f3f3f3f7b4e7d3f3f3f3f3f3f3f3f7b4e7b5e
SJIS-WIN 偲叱偲トナワナ痔{N}偲叱偲トナワナ痔{N{^ 10001110110000111000111010110110100011101100001111000100110001011101110011000101100011101010010001111011010011100111110110001110110000111000111010110110100011101100001111000100110001011101110011000101100011101010010001111011010011100111101101011110 8ec38eb68ec3c4c5dcc58ea47b4e7d8ec38eb68ec3c4c5dcc58ea47b4e7b5e
EUC-JP 偲叱偲トナワナ痔{N}偲叱偲トナワナ痔{N{^ 101111001100010110111100101110001011110011000101100011101100010010001110110001011000111011011100100011101100010110111100101001100111101101001110011111011011110011000101101111001011100010111100110001011000111011000100100011101100010110001110110111001000111011000101101111001010011001111011010011100111101101011110 bcc5bcb8bcc58ec48ec58edc8ec5bca67b4e7dbcc5bcb8bcc58ec48ec58edc8ec5bca67b4e7b5e
UTF-8 偲叱偲トナワナ痔{N}偲叱偲トナワナ痔{N{^ 11100101100000011011001011100101100011111011000111100101100000011011001011101111101111101000010011101111101111101000010111101111101111101001110011101111101111101000010111100111100101111001010001111011010011100111110111100101100000011011001011100101100011111011000111100101100000011011001011101111101111101000010011101111101111101000010111101111101111101001110011101111101111101000010111100111100101111001010001111011010011100111101101011110 e581b2e58fb1e581b2efbe84efbe85efbe9cefbe85e797947b4e7de581b2e58fb1e581b2efbe84efbe85efbe9cefbe85e797947b4e7b5e
UHC ?叱?????痔{N}?叱?????痔{N{^ 001111111111001011101010001111110011111100111111001111110011111111110110110000000111101101001110011111010011111111110010111010100011111100111111001111110011111100111111111101101100000001111011010011100111101101011110 3ff2ea3f3f3f3f3ff6c07b4e7d3ff2ea3f3f3f3f3ff6c07b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)