To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 燾フ醉譟鉎自煆 11111011010110101100110011100111110010111110011010011111111110111100011110001110101010011111101101010110 fb5acce7cbe69ffbc78ea9fb56
EUC-JP 燾フ醉譟鉎自煆 1000111111001010101111011000111011001100111011101100110111101100101000011000111111100011110111111011110010101011100011111100100111110100 8fcabd8ecceecdeca18fe3dfbcab8fc9f4
UTF-8 燾フ醉譟鉎自煆 111001111000011110111110111011111011111010001100111010011000011010001001111010001010110110011111111010011000100110001110111010001000011110101010111001111000010110000110 e787beefbe8ce98689e8ad9fe9898ee887aae78586
UHC 燾?醉??自? 11010100101001110011111111110110101011010011111100111111111011011011101100111111 d4a73ff6ad3f3fedbb3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)