To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 閼ア霎ー貂ャ譽壼ュ俶純迢ク蛛エ蟆願┳螻槫ュ 1110100010000100101100011110100010111110101100001110011010111000101011001110011010100011100110101110010110101101100110001110011010001111100000111110011110001011101110001110010110000001101101001110010110110000100010101110100010000100101100011110010110110001100111101110010110101101 e884b1e8beb0e6b8ace6a39ae5ad98e68f83e78bb8e581b4e5b08ae884b1e5b19ee5ad
EUC-JP 閼ア霎ー貂ャ譽壼ュ俶純迢ク蛛エ蟆願┳螻槫ュ 111011111110010010001110101100011111000011000000100011101011000011101100101110101000111010101100111011001010010111010100111001111000111010101101110100001110100010111101111000111110110111101011100011101011100011101001111000011000111010110100111010101011001010110100111010101010100010110011111010101011001111011100111001111000111010101101 efe48eb1f0c08eb0ecba8eaceca5d4e78eadd0e8bde3edeb8eb8e9e18eb4eab2b4eaa8b3eab3dce78ead
UTF-8 閼ア霎ー貂ャ譽壼ュ俶純迢ク蛛エ蟆願┳螻槫ュ 111010011001011010111100111011111011110110110001111010011001110010001110111011111011110110110000111010001011001010000010111011111011110110101100111010001010110110111101111001011010001110111100111011111011110110101101111001001011111110110110111001111011010010010100111010001011111110100010111011111011110110111000111010001001101110011011111011111011110110110100111010001001111110000110111010011010000110011000111000101001010010110011111010001001111010111011111001101010011110101011111011111011110110101101 e996bcefbdb1e99c8eefbdb0e8b282efbdace8adbde5a3bcefbdade4bfb6e7b494e8bfa2efbdb8e89b9befbdb4e89f86e9a198e294b3e89ebbe6a7abefbdad
UHC 閼???貂?譽???純??蛛??願┳??? 11100100110110010011111100111111001111111111010110110000001111111110011111100010001111110011111100111111111000101110110100111111001111111111000111001000001111110011111111101010110000111010011010110011001111110011111100111111 e4d93f3f3ff5b03fe7e23f3f3fe2ed3f3ff1c83f3feac3a6b33f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)