To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏????ⅳ異??癲?????億 100010010100011100111111001111110011111100111111111110100100001110001000110110010011111100111111111000011001111100111111001111110011111100111111001111111000100110101101 89473f3f3f3ffa4388d93f3fe19f3f3f3f3f3f89ad
EUC-JP 烏?????異??癲?????億 1011000110101000001111110011111100111111001111110011111110110000110110110011111100111111111000101010000100111111001111110011111100111111001111111011001010101111 b1a83f3f3f3f3fb0db3f3fe2a13f3f3f3f3fb2af
UTF-8 烏숅쪓琉억ⅳ異뺟쾬癲딃쪓琉삼쫽億 111001111000001110001111111011001000100010000101111011001010101010010011111011111010011110001100111011001001011010110101111000101000010110110011111001111001010110110000111010111011101010011111111011001011111010101100111001111001100110110010111010111001010010000011111011001010101010010011111011111010011110001100111011001000001010111100111011001010101110111101111001011000010010000100 e7838fec8885ecaa93efa78cec96b5e285b3e795b0ebba9fecbeace799b2eb9483ecaa93efa78cec82bcecabbde58484
UHC 烏숅쪓琉억ⅳ異뺟쾬癲딃쪓琉삼쫽億 1110100010100001100110011110100110100101100011011110101110100100101111101110111110100101101001001110110010110110100101011110011110110010100000111110111110100110100010101110100110100101100011011110101110100100101110111110111110100110100101001110010111100010 e8a199e9a58deba4beefa5a4ecb695e7b283efa68ae9a58deba4bbefa694e5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)