To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 弔?制絅?棒??趙貊?企??怨峯?棒??趙貊?發h 1001001010100010001111111001000010100111111000110100010000111111100101100101111100111111001111111110011011100010111001101011101100111111100010101110100100111111001111111000100110000101100101011111010100111111100101100101111100111111001111111110011011100010111001101011101100111111111000011010001001101000 92a23f90a7e3443f965f3f3fe6e2e6bb3f8ae93f3f898595f53f965f3f3fe6e2e6bb3fe1a268
EUC-JP 弔?制絅?棒??趙貊?企??怨峯?棒??趙貊?發h 1100010010100100001111111100000010101001111001011010010100111111110010111100000000111111001111111110110011100100111011001011110100111111101101001110101100111111001111111011000111100101110010101111011100111111110010111100000000111111001111111110110011100100111011001011110100111111111000101010010001101000 c4a43fc0a9e5a53fcbc03f3fece4ecbd3fb4eb3f3fb1e5caf73fcbc03f3fece4ecbd3fe2a468
UTF-8 弔렲制絅렠棒렕렟趙貊렚企렱렲怨峯렠棒렕렟趙貊렠發h 11100101101111001001010011101011101000001011001011100101100010001011011011100111101101011000010111101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111101000101101101001100111101000101100101000101011101011101000001001101011100100101111001000000111101011101000001011000111101011101000001011001011100110100000001010100011100101101100111010111111101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111101000101101101001100111101000101100101000101011101011101000001010000011100111100110011011110001101000 e5bc94eba0b2e588b6e7b585eba0a0e6a392eba095eba09fe8b699e8b28aeba09ae4bc81eba0b1eba0b2e680a8e5b3afeba0a0e6a392eba095eba09fe8b699e8b28aeba0a0e799bc68
UHC 弔렲制絅렠棒렕렟趙貊렚企렱렲怨峯렠棒렕렟趙貊렠發h 11110000110000001000111010111111111100001010010011001100111001111000111010110001110111001110101010001110101010101000111010110000111100001110000111011000111001111000111010101101110100001110101010001110101111101000111010111111111010101011001111011100111001111000111010110001110111001110101010001110101010101000111010110000111100001110000111011000111001111000111010110001110110111010000101101000 f0c08ebff0a4cce78eb1dcea8eaa8eb0f0e1d8e78eadd0ea8ebe8ebfeab3dce78eb1dcea8eaa8eb0f0e1d8e78eb1dba168

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)