To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 歟??揖??幽??歟 1001111101100010001111110011111110010111010010110011111100111111100101110100100000111111001111111001111101100010 9f623f3f974b3f3f97483f3f9f62
EUC-JP 歟??揖??幽??歟 1101110111000011001111110011111111001101101011000011111100111111110011011010100100111111001111111101110111000011 ddc33f3fcdac3f3fcda93f3fddc3
UTF-8 歟㏐랬揖졿뿿幽뚯뒓歟 111001101010110110011111111000111000111110010000111010111001111010101100111001101000111110010110111011001010000110111111111010111011111110111111111001011011100110111101111010111001101010101111111010111001001010010011111001101010110110011111 e6ad9fe38f90eb9eace68f96eca1bfebbfbfe5b9bdeb9aafeb9293e6ad9f
UHC 歟㏐랬揖졿뿿幽뚯뒓歟 1110011010100010101001111110101010110111101010001110101111100111101000001110011010010111101111111110101011101011100011001110110010001010100100001110011010100010 e6a2a7eab7a8ebe7a0e697bfeaeb8cec8a90e6a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)