To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻る?疑??濡??筌??違э?臾??壓 11100100111010001000001011101001001111111000101101011110001111110011111110010100010001110011111100111111111000101010001100111111001111111000100011100001100001001000111100111111111001000110101100111111001111111001101011011000 e4e882e93f8b5e3f3f94473f3fe2a33f3f88e1848f3fe46b3f3f9ad8
EUC-JP 蒻る?疑??濡??筌??違э?臾??壓 11101000111010101010010011101011001111111011010110111111001111110011111111000111101010000011111100111111111001001010010100111111001111111011000011100011101001111110111100111111111001111100110000111111001111111101010011011010 e8eaa4eb3fb5bf3f3fc7a83f3fe4a53f3fb0e3a7ef3fe7cc3f3fd4da
UTF-8 蒻る쪇疑뗰쭓濡곕젗筌뗫씢違э쭓臾먯젌壓 1110100010010010101110111110001110000010100010111110110010101010100001111110011110010110100100011110101110010111101100001110110010101101100100111110011010111111101000011110101010110011100101011110110010100000100101111110011110101101100011001110101110010111101010111110110010010100101000101110100110000001100101011101000110001101111011001010110110010011111010001000011110111110111010111010100010101111111011001010000010001100111001011010001110010011 e892bbe3828becaa87e79691eb97b0ecad93e6bfa1eab395eca097e7ad8ceb97abec94a2e98195d18decad93e887beeba8afeca08ce5a393
UHC 蒻る쪇疑뗰쭓濡곕젗筌뗫씢違э쭓臾먯젌壓 1110010110110110101010101110101110100101100000011110101111110111100010111110111110100111100010111110101110100001101100001110101110100000100100111110111110100111100010111110101110011101101101101110101011011110101011001110111110100111100010111110101110101100100100001110110010100000100011011110010011100010 e5b6aaeba581ebf78befa78beba1b0eba093efa78beb9db6eadeacefa78bebac90eca08de4e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)