To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??純?????鸚??筍??碎μ?筌??揖 111000101010001100111111001111111000111110000011001111110011111100111111001111110011111111101010010111110011111100111111111000101010000100111111001111111110000111101010100000111100101000111111111000101010001100111111001111111001011101001011 e2a33f3f8f833f3f3f3f3fea5f3f3fe2a13f3fe1ea83ca3fe2a33f3f974b
EUC-JP 筌??純??洧??鸚??筍??碎μ?筌??揖 1110010010100101001111110011111110111101111000110011111100111111100011111100011110110100001111110011111111110011110000000011111100111111111001001010001100111111001111111110001011101100101001101100110000111111111001001010010100111111001111111100110110101100 e4a53f3fbde33f3f8fc7b43f3ff3c03f3fe4a33f3fe2eca6cc3fe4a53f3fcdac
UTF-8 筌㏂끉純뤷ㅇ洧우퐧鸚룸뜆筍앾㎖碎μ퐣筌듭쉹揖 1110011110101101100011001110001110001111100000101110101110000001100010011110011110110100100101001110101110100100101101111110001110000101100001111110011010110100101001111110110010011010101100001110110110010000101001111110100110111000100110101110101110100011101110001110101110011100100001101110011110101101100011011110110010010101101111101110001110001110100101101110011110100010100011101100111010111100111011011001000010100011111001111010110110001100111010111001001110101101111011001000100110111001111001101000111110010110 e7ad8ce38f82eb8189e7b494eba4b7e38587e6b4a7ec9ab0ed90a7e9b89aeba3b8eb9c86e7ad8dec95bee38e96e7a28ecebced90a3e7ad8ceb93adec89b9e68f96
UHC 筌㏂끉純뤷ㅇ洧우퐧鸚룸뜆筍앾㎖碎μ퐣筌듭쉹揖 1110111110100111101000101110001110000101101111001110001011101101100011111110010110100100101101111110101011111011101111111110110010111101100100001110010110100100101101111110101110001101100010011110001011101100100111011110111110100111101000101110000111101111101001011110110010111101100011001110111110100111101101011110110010011010100011111110101111100111 efa7a2e385bce2ed8fe5a4b7eafbbfecbd90e5a4b7eb8d89e2ec9defa7a2e1efa5ecbd8cefa7b5ec9a8febe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)