To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??踰?????? 100101101110100100111111001111111110011011111010001111110011111100111111001111110011111100111111 96e93f3fe6fa3f3f3f3f3f3f
EUC-JP 夜??踰??洧??? 1100110011101011001111110011111111101100111111000011111100111111100011111100011110110100001111110011111100111111 cceb3f3fecfc3f3f8fc7b43f3f3f
UTF-8 夜쎼굤踰㎪꼮洧얜턂略 111001011010010010011100111011001000111010111100111010101011010110100100111010001011100010110000111000111000111010101010111010101011110010101110111001101011010010100111111011001001011010011100111011011000010010000010111011111010010110110110 e5a49cec8ebceab5a4e8b8b0e38eaaeabcaee6b4a7ec969ced8482efa5b6
UHC 夜쎼굤踰㎪꼮洧얜턂略 1110010110101000100110111110001110000010100010101110101110110010101001111110011010000100100010011110101011111011101111101110101110110101100111101110010110110010 e5a89be3828aebb2a7e68489eafbbeebb59ee5b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)