To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 弔?制絅?棒??畯貊?企??怨峯?棒??畯貊?發h 1001001010100010001111111001000010100111111000110100010000111111100101100101111100111111001111111111101101101111111001101011101100111111100010101110100100111111001111111000100110000101100101011111010100111111100101100101111100111111001111111111101101101111111001101011101100111111111000011010001001101000 92a23f90a7e3443f965f3f3ffb6fe6bb3f8ae93f3f898595f53f965f3f3ffb6fe6bb3fe1a268
EUC-JP 弔?制絅?棒??畯貊?企??怨峯?棒??畯貊?發h 11000100101001000011111111000000101010011110010110100101001111111100101111000000001111110011111110001111110011011011101111101100101111010011111110110100111010110011111100111111101100011110010111001010111101110011111111001011110000000011111100111111100011111100110110111011111011001011110100111111111000101010010001101000 c4a43fc0a9e5a53fcbc03f3f8fcdbbecbd3fb4eb3f3fb1e5caf73fcbc03f3f8fcdbbecbd3fe2a468
UTF-8 弔렲制絅렠棒렕렟畯貊렚企렱렲怨峯렠棒렕렟畯貊렠發h 11100101101111001001010011101011101000001011001011100101100010001011011011100111101101011000010111101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111100111100101011010111111101000101100101000101011101011101000001001101011100100101111001000000111101011101000001011000111101011101000001011001011100110100000001010100011100101101100111010111111101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111100111100101011010111111101000101100101000101011101011101000001010000011100111100110011011110001101000 e5bc94eba0b2e588b6e7b585eba0a0e6a392eba095eba09fe795afe8b28aeba09ae4bc81eba0b1eba0b2e680a8e5b3afeba0a0e6a392eba095eba09fe795afe8b28aeba0a0e799bc68
UHC 弔렲制絅렠棒렕렟畯貊렚企렱렲怨峯렠棒렕렟畯貊렠發h 11110000110000001000111010111111111100001010010011001100111001111000111010110001110111001110101010001110101010101000111010110000111100011110000111011000111001111000111010101101110100001110101010001110101111101000111010111111111010101011001111011100111001111000111010110001110111001110101010001110101010101000111010110000111100011110000111011000111001111000111010110001110110111010000101101000 f0c08ebff0a4cce78eb1dcea8eaa8eb0f1e1d8e78eadd0ea8ebe8ebfeab3dce78eb1dcea8eaa8eb0f1e1d8e78eb1dba168

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)