To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣③?碎??畏?????矣??耀??? 00111111001111110011111110001011100000111000011101000010001111111110000111101010001111110011111110001000110110000011111100111111001111110011111100111111111000011110000100111111001111111001011101110011001111110011111100111111 3f3f3f8b8387423fe1ea3f3f88d83f3f3f3f3fe1e13f3f97733f3f3f
EUC-JP 濚??泣??碎??畏?????矣??耀??? 1000111111001001101000010011111100111111101101011110001100111111001111111110001011101100001111110011111110110000110110100011111100111111001111110011111100111111111000101110001100111111001111111100110111010100001111110011111100111111 8fc9a13f3fb5e33f3fe2ec3f3fb0da3f3f3f3f3fe2e33f3fcdd43f3f3f
UTF-8 濚뱀슱泣③쪙碎좊쳩畏븐옊六쀧뭐矣몄뒯耀믠몺紐 111001101011111110011010111010111011000110000000111011001000101010110001111001101011001110100011111000101001000110100010111011001010101010011001111001111010001010001110111011001010001010001010111011001011001110101001111001111001010110001111111010111011100010010000111011001001100010001010111011111010011110010001111011001000000010100111111010111010110110010000111001111001111110100011111010111010101010000100111010111001001010101111111010001000000010000000111010111010111110100000111010111010101010111010111011111010011110001111 e6bf9aebb180ec8ab1e6b3a3e291a2ecaa99e7a28eeca28aecb3a9e7958febb890ec988aefa791ec80a7ebad90e79fa3ebaa84eb92afe88080ebafa0ebaabaefa78f
UHC 濚뱀슱泣③쪙碎좊쳩畏븐옊六쀧뭐矣몄뒯耀믠몺紐 1110011110111001101110011110110010011010101110001110101111101000101010001110100110100101100100101110000111101111101000001110101110101011100011101110100011100110101110101110110010011110100100101110101110111011100101111110011110111001101110011110101111111000101110001110110010001010101010001110100110100101100100101110001010010001101000001110101110101010 e7b9b9ec9ab8ebe8a8e9a592e1efa0ebab8ee8e6baec9e92ebbb97e7b9b9ebf8b8ec8aa8e9a592e291a0ebaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)