To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???鎰?┸儀??}???鎰?┸儀??{^ 001111110011111100111111111010000100110000111111100001001011110110001011010101100011111100111111011111010011111100111111001111111110100001001100001111111000010010111101100010110101011000111111001111110111101101011110 3f3f3fe84c3f84bd8b563f3f7d3f3f3fe84c3f84bd8b563f3f7b5e
EUC-JP ???鎰?┸儀??}???鎰?┸儀??{^ 001111110011111100111111111011111010110100111111101010001011111110110101101101110011111100111111011111010011111100111111001111111110111110101101001111111010100010111111101101011011011100111111001111110111101101011110 3f3f3fefad3fa8bfb5b73f3f7d3f3f3fefad3fa8bfb5b73f3f7b5e
UTF-8 娛붿룞鎰먲┸儀숈뒛}娛붿룞鎰먲┸儀숈뒛{^ 111001011010100010011011111010111011011010111111111010111010001110011110111010011000111010110000111010111010100010110010111000101001010010111000111001011000010010000000111011001000100010001000111010111001001010011011011111011110010110101000100110111110101110110110101111111110101110100011100111101110100110001110101100001110101110101000101100101110001010010100101110001110010110000100100000001110110010001000100010001110101110010010100110110111101101011110 e5a89bebb6bfeba39ee98eb0eba8b2e294b8e58480ec8888eb929b7de5a89bebb6bfeba39ee98eb0eba8b2e294b8e58480ec8888eb929b7b5e
UHC 娛붿룞鎰먲┸儀숈뒛}娛붿룞鎰먲┸儀숈뒛{^ 111001111111010010010100111011001000111110011001111011001111000010010000111011111010011010111111111010111111000010011001111011001000101010011000011111011110011111110100100101001110110010001111100110011110110011110000100100001110111110100110101111111110101111110000100110011110110010001010100110000111101101011110 e7f494ec8f99ecf090efa6bfebf099ec8a987de7f494ec8f99ecf090efa6bfebf099ec8a987b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)