To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 乳℡?哀嶝??壓????℡?哀 10010011111110111000011110000100001111111000100010100011100110111101000100111111001111111001101011011000001111110011111100111111001111111000011110000100001111111000100010100011 93fb87843f88a39bd13f3f9ad83f3f3f3f87843f88a3
EUC-JP 乳??哀嶝??壓??????哀 1100011011111101001111110011111110110000101001011101011011010011001111110011111111010100110110100011111100111111001111110011111100111111001111111011000010100101 c6fd3f3fb0a5d6d33f3fd4da3f3f3f3f3f3fb0a5
UTF-8 乳℡썬哀嶝렒렦壓몄렮며롛℡썬哀 111001001011100110110011111000101000010010100001111011001000110110101100111001011001001110000000111001011011011010011101111010111010000010010010111010111010000010100110111001011010001110010011111010111010101010000100111010111010000010101110111010111010100110110000111010111010000110011011111000101000010010100001111011001000110110101100111001011001001110000000 e4b9b3e284a1ec8dace59380e5b69deba092eba0a6e5a393ebaa84eba0aeeba9b0eba19be284a1ec8dace59380
UHC 乳℡썬哀嶝렒렦壓몄렮며롛℡썬哀 111010101110000110100010111001011011110111100011111001001110111011010100111100011000111010100111100011101011010111100100111000101011100011101100100011101011101110111000111001111000111011011111101000101110010110111101111000111110010011101110 eae1a2e5bde3e4eed4f18ea78eb5e4e2b8ec8ebbb8e78edfa2e5bde3e4ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)