To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 曜????????Lh曜????????L 1001011101101010001111110011111100111111001111110011111100111111001111110011111101001100011010001001011101101010001111110011111100111111001111110011111100111111001111110011111101001100 976a3f3f3f3f3f3f3f3f4c68976a3f3f3f3f3f3f3f3f4c
EUC-JP 曜??濚??濚??Lh曜??濚??濚??L 11001101110010110011111100111111100011111100100110100001001111110011111110001111110010011010000100111111001111110100110001101000110011011100101100111111001111111000111111001001101000010011111100111111100011111100100110100001001111110011111101001100 cdcb3f3f8fc9a13f3f8fc9a13f3f4c68cdcb3f3f8fc9a13f3f8fc9a13f3f4c
UTF-8 曜랃쉈濚앾쉑濚㏆스Lh曜랃쉈濚앾쉑濚㏆스L 111001101001101110011100111010111001111010000011111011001000100110001000111001101011111110011010111011001001010110111110111011001000100110010001111001101011111110011010111000111000111110000110111011001000101010100100010011000110100011100110100110111001110011101011100111101000001111101100100010011000100011100110101111111001101011101100100101011011111011101100100010011001000111100110101111111001101011100011100011111000011011101100100010101010010001001100 e69b9ceb9e83ec8988e6bf9aec95beec8991e6bf9ae38f86ec8aa44c68e69b9ceb9e83ec8988e6bf9aec95beec8991e6bf9ae38f86ec8aa44c
UHC 曜랃쉈濚앾쉑濚㏆스Lh曜랃쉈濚앾쉑濚㏆스L 111010001111100010001101111011111011110110100101111001111011100110011101111011111011110110100111111001111011100110100111111011111011110110111010010011000110100011101000111110001000110111101111101111011010010111100111101110011001110111101111101111011010011111100111101110011010011111101111101111011011101001001100 e8f88defbda5e7b99defbda7e7b9a7efbdba4c68e8f88defbda5e7b99defbda7e7b9a7efbdba4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)